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Preface 


This is a book about arithmetic subgroups of semisimple Lie groups, which 
means that we will discuss the group SL(n, Z), and certain of its subgroups. 
By definition, the subject matter combines algebra (groups of matrices) with 
number theory (properties of the integers). However, it also has important 
applications in geometry. In particular, arithmetic groups arise in classical 
differential geometry as the fundamental groups of locally symmetric spaces. 
(See Chapters 1 and 2 for an elaboration of this line of motivation.) They 
also provide important examples and test cases in geometric group theory. 

My intention in this text is to give a fairly gentle introduction to several 
of the main methods and theorems in the subject. There is no attempt to be 
encyclopedic, and proofs are usually only sketched, or only carried out for an 
illustrative special case. Readers with sufficient background will learn much 
more from [Ma] and [PR] (written by the masters) than they can find here. 

The book assumes knowledge of algebra, analysis, and topology that 
might be taught in a first-year graduate course, plus some acquaintance with 
Lie groups. (Appendix A quickly recounts the essential Lie theory, and Ap- 
pendix B lists the required facts from graduate courses.) Some individual 
proofs and examples assume additional background (but may be skipped). 

Generally speaking, the chapters are fairly independent of each other (and 
they all have their own bibliographies), so there is no need to read the book 
linearly. To facilitate making a plan of study, the bottom of each chapter’s 
first page states the main prerequisites that are not in appendices A and B. 
Individual chapters (or, sometimes, sections) could be assigned for reading in a 
course or presented in a seminar. (The book has been released into the public 
domain, so feel free to make copies for such purposes.) Notes at the end of 
each chapter have suggestions for further reading. (Many of the subjects have 
been given book-length treatments.) Several topics (such as amenability and 
Kazhdan’s property (T)) are of interest well beyond the theory of arithmetic 
groups. 

Although this is a long book, some very important topics have been omit- 
ted. In particular, there is almost no discussion of the cohomology of arith- 
metic groups, even though it is a subject with a long history and continues to 
be a very active field. (See the lecture notes of Borel [B2] for a recent survey.) 


PREFACE 


Also, there is no mention at all of automorphic forms. (Recent introductions 
to this subject include [De] and [SS].) 

Among the other books on arithmetic groups, the authoritative mono- 
graphs of Margulis [Ma] and Platonov-Rapinchuk [PR] have already been 
mentioned. They are essential references, but would be difficult reading for 
my intended audience. Some works at a level more comparable to this book 
include: 


[B]] 


[Hu] 


[Ji 


This classic gives an explanation of reduction theory (discussed here in 
Chapter 19) and some of its important consequences. 


This exposition covers reduction theory (at a more elementary level 
than [B1]), adeles, ideles, and fundamentals of the Congruence Sub- 
group Property (mentioned here in Remark 17.1.3(4)). 


This extensive survey touches on many more topics than are covered 
here (or even in [Ma] and |PR]), with 60 pages of references. 


This monograph thoroughly discusses arithmetic subgroups of the groups 
SL(2,R) and SL(2,C). 


This is an essential reference (along with [Ma] and [PR]). It is the 
standard reference for basic properties of lattices in Lie groups (cov- 
ered here in Chapter 4). It also has proofs of the Godement Criterion 
(discussed here in Section 5.3), the existence of both cocompact and 
noncocompact arithmetic subgroups (discussed here in Section 18.7), 
and reduction theory for arithmetic groups of Q-rank one (discussed 
here in Chapter 19). It also includes several topics not covered here, 
such as cohomology vanishing theorems, and lattices in non-semisimple 
Lie groups. 


This textbook provides an elementary introduction to the Congru- 
ence Subgroup Property (which has only a brief mention here in Re- 
mark 17.1.3(4)). 


After developing the necessary prerequisites in ergodic theory and rep- 
resentation theory, this monograph provides proofs of three major theo- 
rems of Margulis: Superrigidity, Arithmeticity, and Normal Subgroups 
(discussed here in Chapters 16 and 17). It also proves a generalization 
of the superrigidity theorem that applies to “Borel cocycles’ 


Dave Morris 
April 2015 


[B]] 


[B2] 


Ji 


Mal 
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Introduction 


Chapter 1 


What is a 
Locally Symmetric Space? 


In this chapter, we give a geometric introduction to the notion of a symmetric 
space or a locally symmetric space, and explain the central role played by 
simple Lie groups and their lattice subgroups. (Since geometers are the target 
audience here, we assume familiarity with differential geometry that will not 
be needed in other parts of the book.) This material is not a prerequisite for 
reading any of the later chapters, except Chapter 2; it is intended to provide a 
geometric motivation for the study of lattices in semisimple Lie groups. Since 
arithmetic subgroups are the primary examples of lattices, this also motivates 
the main topic of the rest of the book. 


§1.1. Symmetric spaces 


Recall that a Riemannian manifold is a smooth manifold M, together with 
the choice of an inner product (: | -), on the tangent space T,,M, for each 
x € M, such that (- | -), varies smoothly as x varies. The nicest Riemannian 
manifolds are homogeneous. This means that every point looks exactly like 
every other point: 


You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: understanding of geodesics, and other 


concepts of Differential Geometry. 


1. WHAT IS A LOCALLY SYMMETRIC SPACE? 


(1.1.1) Definition. A Riemannian manifold X is a homogeneous space if 
its isometry group Isom(X) acts transitively. That is, for every x,y € X, 
there is an isometry ¢ of X, such that $(x) = y. 


(1.1.2) Notation. We use G° to denote the identity component of the 
group G. 


(1.1.3) Examples. Here are some elementary examples of (simply connected) 
homogeneous spaces. 


1) 


5) 


The round sphere S$” = {x € R"*! | ||z|| = 1}. Rotations are the 
only orientation-preserving isometries of S”, so we have Isom($”)° = 
SO(n + 1). Any point on S$” can be rotated to any other point, so S” 
is homogeneous. 


Euclidean space R”. Every orientation-preserving isometry of R” is 
a combination of a translation and a rotation, and this implies that 
Isom(R”)° = SO(n) k R”. Any point in R” can be translated to any 
other point, so R” is homogeneous. 


The hyperbolic plane 9? = {z € C | Imz > 0}, where the inner 
product on T,9)? is given by 


1 


4(Im z)2 (u | v)R2. 


(u|v)g2= 
It is not difficult to show that 
Isom(§)”)° is isomorphic to PSL(2,R)° = SL(2,R)/{+1}, 
by noting that SL(2,R) acts on $)? by linear-fractional transformations 


z+ (az+b)/(cz4+d), and confirming, by calculation, that these linear- 
fractional transformations preserve the hyperbolic metric. 


Hyperbolic space 9” = {x € R” | x, > 0}, where the inner product 
on T,§)" is given by 


1 
(u | UV) Hn = da? (u | u)Re. 


It is not difficult to see that 9” is homogeneous (see Exercise 1). One 
can also show that that the group Isom(§)")° is isomorphic to SO(1,n)° 
(see Exercise 4). 


A cartesian product of any combination of the above (see Exercise 6). 


(1.1.4) Definitions. Let ¢: X — X. 


1) 
2) 
3) 


We say that ¢ is involutive (or that ¢ is an involution) if ¢? = Id. 
A fixed point of ¢ is a point p € X, such that ¢(p) = p. 

A fixed point p of ¢ is tsolated if there is a neighborhood U of p, such 
that p is the only fixed point of ¢ that is contained in U. 


Besides an isometry taking x to y, each of the above spaces also has a 
nice involutive isometry that fixes x. 
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1) Define ¢1: S" > S” by 


o1(£1, soe Ona) = (—21, seey En, Badays 
Then ¢; is an isometry of S”, such that ¢, has only two fixed points: 
namely, €n41 and —€n41, where e€n41 = (0,0,...,0,1). Therefore, en+1 


is an isolated fixed point of ¢}. 

2) Define dg: R” > R” by ¢o(x) = —x. Then ¢2 is an isometry of R”, 
such that 0 is the only fixed point of ¢o. 

3) Define 43: 97 + ? by $3(z) = —1/z. Then i is the only fixed point 
of 3. 

4) There are involutive isometries of §)” that have a unique fixed point 
(see Exercise 3), but they are somewhat difficult to describe in the 
upper-half-space model that we are using. 


The existence of such an isometry is the additional condition that is required 
to be a symmetric space. 
(1.1.5) Definition. A Riemannian manifold X is a symmetric space if 
1) X is connected, 
2) X is homogeneous, and 
3) there is an involutive isometry ¢@ of X, such that @ has at least one 
isolated fixed point. 


(1.1.6) Remark. If X is a symmetric space, then all points of X are essentially 
the same, so, for each x € X (not only for some x € X), there is an isometry ¢ 
of X, such that ¢? = Id and z is an isolated fixed point of ¢ (see Exercise 9). 
Conversely, if Condition (3) is replaced with this stronger assumption, then 
Condition (2) can be omitted (see Exercise 10). 


We constructed examples of involutive isometries of S$”, R”, and 9” that 
have an isolated fixed point. The following proposition shows that no choice 
was involved: the involutive isometry with a given isolated fixed point p is 
unique, if it exists. Furthermore, in the exponential coordinates at p, the 
involution must simply be the map 7 > —2. 


(1.1.7) Proposition. Suppose ¢ is an involutive isometry of a Riemmanian 
manifold X, and suppose p is an isolated fixed point of d. Then 
1) dé, = —Id, and 
2) for every geodesic y with y(0) = p, we have ¢(y(t)) = y(-t), for all 
teER. 


Proof. (1) From the Chain Rule, and the fact that ¢(p) = p, we have 


d($7)p = dO 4(p) 0 dbp = (dd,)?. 
Also, because ¢” = Id, we know that d(¢?), = dId, = Id. We conclude that 
(dé,)* = Id; hence, the linear transformation d¢,: T,X — T,X satisfies the 
polynomial equation x? — 1 = 0. 
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Suppose dé, # —Id. (This will lead to a contradiction.) Since the 
polynomial x? — 1 has no repeated roots, we know that dd, is diagonalizable. 
Furthermore, because 1 and —1 are the only roots of x? — 1, we know that 1 
and —1 are the only possible eigenvalues of d¢p. Therefore, because d¢p # 
—Id, we conclude that 1 is an eigenvalue; so we may choose some nonzero 
v € T,X, such that dé,(v) = v. Let y be the geodesic with y(0) = p and 
y'(0) = v. Then, because ¢ is an isometry, we know that go 7¥ is also a 
geodesic. We have 


($0 7)(0) = (7(0)) = o() = p = 7(0) 

and 
(60 ¥)'(0) = dd.) (7'(0)) = dbp(v) = v = 7(0). 
Since every geodesic is uniquely determined by prescribing its initial position 
and its initial velocity, we conclude that 6o7 = y. Therefore, 6(7(t)) = A(t), 
so y(t) is a fixed point of ¢, for every t. This contradicts the fact that the 
fixed point p = 7(0) is isolated. 
(2) Define 7(t) = y(—t), so ¥ is a geodesic. Because ¢ is an isometry, we 

know that ¢07 is also a geodesic. We have 

($0 7)(0) = $(7(0)) = ) = p = 7(0) 
and, from (1), 

($ 07)'(0) = dby(o) (7'(0)) = -7'(0) = (0). 

Since a geodesic is uniquely determined by prescribing its initial position and 
its initial velocity, we conclude that 0 y = 7, as desired. LJ 


(1.1.8) Definition. Let M be a Riemannian manifold, and let p € M. It is 
a basic fact of differential geometry that there is a neighborhood V of 0 in 
T,M, such that the exponential map exp, maps V diffeomorphically onto a 
neighborhood U of p in M. By making V smaller, we may assume it is: 

e symmetric (that is, -V = V), and 

e star-shaped (that is, tV CV, for0 <t <1). 
The geodesic symmetry at p is the diffeomorphism 7 of U that is defined 
by 

T (exp, (v)) = €Xp,(—v), 

for allue V. 


In other words, for each geodesic y in M, such that 7(0) = p, and for all 
t € R, such that ty/(0) € V, we have 7(7(t)) = 7(—?). 


Note. The geodesic symmetry T is a local diffeomorphism, but, for most man- 
ifolds M, it is not a local isometry (cf. Remark 1.3.2). 


In this terminology, the preceding proposition shows that if an involutive 
isometry @ has a certain point p as an isolated fixed point, then, locally, @ 
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must agree with the geodesic symmetry at p. This has the following easy 
consequence, which is the motivation for the term symmetric space. 


(1.1.9) Corollary. A connected Riemannian manifold M is a symmetric space 
if and only if, for each p € M, the geodesic symmetry at p extends to an 
isometry of M. 


Exercises for §1.1. 


#1. 


49. 


#3. 


4A, 


Show that §)” is homogeneous. 


[Hint: For any t € R*, the dilation x ++ tx is an isometry of §”. Also, for 
any v € R”~', the translation x +> x + v is an isometry of 9”.] 


Let B® = {x € R” | ||z|| < 1} be the open unit ball in R”, equip T;,B” 
with the inner product 
1 
(u |v) Bm = ———— (u | v) Re, 
(1 — |lx[I?) 
and let e, = (0,0,...,0,1) € R”. Show that the map ¢: B" > ” 
defined by 


L+En, 1 
i Cn 
WA) = ere I 


is an isometry from B, onto §)”. (In geometric terms, ¢ is obtained by 
composing a translation with the inversion centered at the south pole 
of B”.) 


Show that x +> —z is an isometry of B” (with respect to the Riemann- 
ian metric (- | -)g» defined in Exercise 2). 


For u,v € Rt, define 


n 
(u | v)in = Uovo — ye: 
j=l 


(Note that, for convenience, we start our numbering of the coordinates 
at 0, rather than at 1.) Let 


Xf, ={xeE cr | (2 |e) =1, ro. 0}, 
so X i » is one sheet of a 2-sheeted hyperboloid. Equip T,X i , with the 


inner product obtained by restricting (- | -)1,, to this subspace. 
a) Show that the bijection 7: B" > Xt defined by 


1 
W(x) = T— Jel? (1, z) 


is an isometry. (Note that this implies that the restriction of (- | 
‘)in to T,X, is positive definite, even though (- | -)i., is not 


1,n 


positive definite on all of R"*?.) 
b) Show SO(1,n)° acts transitively on X}',, by isometries. 
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#5. For G = SO(1,n)° = Isom(S9”)°, show there is some p € $9”, such that 
Stabe(p) = SO(n). 
[Hint: This is easy in the hyperboloid model X is pa 


#6. Show that if X,, Xo,...,X, are homogeneous spaces, then the cartesian 
product X, x X2g x--- x X,y is also homogeneous. 


#7. Show that every homogeneous space is geodesically complete. That 
is, for every geodesic segment y: (—e,€) + X, there is a doubly-infinite 
geodesic 7: R > X, such that ¥(t) = y(t) for all t € (—e,€). 


#8. Show that if X1,...,X, are symmetric spaces, then the cartesian prod- 
uct X1 x Xg x--- xX X» is also a symmetric space. 


#9. Show that if X is a symmetric space, then, for each x € X, there is 
an isometry ¢ of X, such that ¢? = Id and z is an isolated fixed point 


of ¢. 


#10. Let X be a connected Riemannian manifold, and assume, for each x € 
X, that there is an isometry ¢ of X, such that ¢? = Id and z is an 
isolated fixed point of ¢. Show that X is homogenous, and conclude 
that X is a symmetric space. 


#11. Show that the real projective space RP” (with the metric that makes 
its universal cover a round sphere) has an involutive isometry ¢, such 
that ¢ has both an isolated fixed point, and a fixed point that is not 
isolated. Is RP” a symmetric space? 


§1.2. How to construct a symmetric space 


In this section, we describe how Lie groups are used to construct symmetric 
spaces. Let us begin by recalling the well-known group-theoretic structure of 
any homogeneous space. 

Suppose X is a connected homogeneous space, and let G = Isom(X)?. 
Because Isom(X) is transitive on X, and X is connected, we see that G is 
transitive on X (see Exercise 1), so we may identify X with the coset space 
G/K, where K is the stabilizer of some point in X. Note that K is compact 
(see Exercise 2). 

Conversely, if K is any compact subgroup of any Lie group G, then there 
is a G-invariant Riemannian metric on G/K (see Exercise 4), so G/K (with 
this metric) is a homogeneous space. (For any closed subgroup H of G, 
the group G acts transitively on the manifold G/H, by diffeomorphisms. 
However, when H is not compact, G usually does not act by isometries of 
any Riemannian metric on G'/H, so there is no reason to expect G/H to be 
a homogeneous space in the sense of Definition 1.1.1.) 
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(1.2.1) Example. 
1) For X = S”, we have G = SO(n + 1), and we may let 
K =Stabe(en+1) = SO(n), so S” =SO(n+1)/SO(n). 
Note that, letting o be the diagonal matrix 
go = diag(—1,—-1,...,—1,1), 
we have 0? = Id, and K = Cg(c) is the centralizer of o in G. 
2) For X = R”, we have G = SO(n) x R”, and we may let 
K = Stabg(0) = SO(n), so R” = (SO(n) x R”)/SO(n). 


Note that the map o: (k,v) +> (k, —v) is an automorphism of G, such 
that o? = Id, and 


Cale) ={9€ Glog) =g} =K. 
3) For X = 9”, we have G ~ SL(2,R), and we may let 
K =Staba(i) + SO(2), so 9? = SL(2,R)/SO(2). 

4) For X = §)", we have G = SO(1,n)°, and we may take K = SO(n) 
(see Exercise 1.145). Note that, for o = diag(1,—1,—1,...,—1), we 
have 0? = Id, and K = Cg(c). 

Therefore, in each of these cases, there is an automorphism o of G, such that 
K is the centralizer of 0. (In other words, K = {k € G| o(k) =k} is the set 


of fixed points of o in G.) The following proposition shows, in general, that 
a slightly weaker condition makes G/K symmetric, not just homogeneous. 


(1.2.2) Proposition. Let 
e G be a connected Lie group, 
e K be a compact subgroup of G, and 


e o be an involutive automorphism of G, such that K is an open subgroup 
of Ca(c). 
Then G/K can be given the structure of a symmetric space, such that the map 
T(gK) = 0(g)K is an involutive isometry of G/K with eK as an isolated fixed 
point. 


Proof. To simplify the proof slightly, let us assume that K = Cg(c) (see Ex- 
ercise 5). 

Because K is compact, we know there is a G-invariant Riemmanian metric 
on G/K (see Exercise 4). Then, because (7) is finite, and normalizes G, 
it is not difficult to see that we may assume this metric is also T-invariant 
(see Exercise 6). (This conclusion can also be reached by letting Gt = (0) KG 
and Kt = (a) x K, so K* is a compact subgroup of Gt, such that GT/KT = 
G/K.) Therefore, 7 is an involutive isometry of G/K. 

Suppose gK is a fixed point of 7, with g ~ e. Then o(g) € gK, so we 
may write o(g) = gk, for some k € K. Since o centralizes k (and o is an 
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automorphism), we have 
o°(g) = o(0(9)) = o(gk) = o(g) o(k) = (gk) (k) = gk. 

On the other hand, we know o7(g) = g (because o is involutive), so we 
conclude that k? = e. 

Since g © e, and o(e) =e, we have o(g) © g, sok=g ‘o(g) Se. Since 
k? = e, we conclude that k = e. (There is a neighborhood U of e in G, such 
that, for every u € U \ {e}, we have u? # e.) Therefore o(g) = gk = ge = g, 
so g € Ca(c) = K; hence, gk = eK. O 


1 


Conversely, for any symmetric space X, there exist G, K, and o as in 
Proposition 1.2.2, such that X is isometric to G/K (see Exercise 7). 


(1.2.3) Example. Let G = SL(n,R), K = SO(n), and define o(g) = (g~+)* 
(the transpose-inverse). Then a? = 1 and Cg(c) = K, so the theorem implies 
that G/K is a symmetric space. Let us describe this space somewhat more 
concretely. 

Recall that any real symmetric matrix A can be diagonalized over R. In 
particular, all of its eigenvalues are real. If all the eigenvalues of A are strictly 
positive, then we say that A is positive definite. 

Let 


X ={Ae€SL(n,R) | A is symmetric and positive definite }, 
and define a: G x X + X by a(g,x) = gag’. Then: 

a) a defines an action of G on X; ie., we have a(gh,x) = a(g,a(h,2)) 
for allg,hE GandxeEeX. 

b) This action is transitive, and we have kK = Stabg(Id), so X may be 
identified with G/K. 

c) TigX = {u € Matny.»(R) | u is symmetric and trace(u) = 0}. (By def- 
inition, we have X C SL(n,R). The condition trace(w) = 0 is obtained 
by differentiating the restriction det(A) = 1.) 

d) The inner product (u | v) = trace(wv) on TjqX is K-invariant, so it 
may be extended to a G-invariant Riemannian metric on X. 


e) The map rt: X > X, defined by 7(A) = A™!, is an involutive isometry 
of X, such that r(a(g,x)) = o(g) T(z) for allg € Gand ze X. 
(1.2.4) Example. Other examples of symmetric spaces are: 
1) SL(n, C)/SU(n), and 
2) SO(p, 4)°/(SO(p) x SO(q)). 


These are special cases of a consequence of Proposition 1.2.2 that will be 
stated after we introduce some terminology. 
(1.2.5) Definitions. 


1) A symmetric space X is irreducible if its universal cover is not iso- 
metric to any nontrivial product X; x Xo. 
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2) A Riemannian manifold is flat if its curvature tensor is identically zero, 
or, equivalently, if every point in X has a neighborhood that is isometric 
to an open subset of the Euclidean space R”. 


(1.2.6) Proposition. Let G be a connected, noncompact, simple Lie group 
with finite center. Then G has a maximal compact subgroup K (which is 
unique up to conjugacy), and G/K is a simply connected, noncompact, irre- 
ducible symmetric space. Furthermore, G/K has non-positive sectional cur- 
vature and is not flat. 

Conversely, any noncompact, non-flat, irreducible symmetric space is of 
the form G/K, where G is a connected, noncompact, simple Lie group with 
trivial center, and K is a maximal compact subgroup of G. 


(1.2.7) Remark. Let K be a compact subgroup of a connected, simple Lie 
group G with finite center, such that G/K is a symmetric space (cf. Propo- 
sition 1.2.2). Proposition 1.2.6 shows that if G is not compact, then kK must 
be a maximal compact subgroup of G, which is essentially unique. 

On the other hand, if G is compact, then the subgroup K may not be 
unique, and may not be maximal. For example, both SO(n)/SO(n — 1) and 
SO(n)/{e} are symmetric spaces. The former is a round sphere, which has 
already been mentioned. The latter is a special case of the fact that every 
connected, compact Lie group is a symmetric space (see Exercise 10). 

E. Cartan obtained a complete list of all the symmetric spaces (both com- 
pact and noncompact) by finding all of the simple Lie groups G (see Theo- 
rem A2.7), and determining, for each of them, which compact subgroups K 
can arise in Proposition 1.2.2. 


Exercises for §1.2. 


#1. Suppose a topological group G acts transitively (and continuously) on 
a connected topological space M. Show that the identity component 
G° is transitive on M. 


#2. Let {gn} be a sequence of isometries of a connected, complete Riemann- 
ian manifold WM, and assume there exists p € M, such that gn,p = p 
for all n.) Show there is a subsequence {gn,} of {gn} that converges 
uniformly on compact subsets of MW. (That is, there is some isometry g 
of M, such that, for every « > 0 and every compact subset C' of M, 
there exists ko, such that d(gn,c, gc) < € for all c € C and all k > ko.) 
[Hint: This is a special case of the Arzel-Ascoli Theorem. For each c € C, 
the sequence {gnc} is bounded, and therefore has a convergent subsequence. 


By Cantor diagonalization, there is a subsequence that works for all c in a 
countable, dense subset of C.] 


#3. Let K be a compact group, and let p: K — GL(n,R) be a continuous 
homomorphism. Show that there is a K-invariant inner product (- | -) K 
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$5. 


#6. 


#7. 


#8. 


#9. 


410. 
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on R”; that is, such that (p(k)u | p(k)u) re = (u | Uy foralke Kk 
and all u,v € R”. 


[Hint: Define (u| v)« = fi, (p(k)u | p(k)v) du(k), where jz is Haar measure 
on K.] 


Let K be a compact subgroup of a Lie group G. Use Exercise 3 to show 
that there is a G-invariant Riemannian metric on G/K. 


[Hint: A G-invariant Riemannian metric on G/K is determined by the inner 
product it assigns to the tangent space T.«(G/K).| 


Complete the proof of Proposition 1.2.2, by removing the simplifying 
assumption that K = Cg(o). 


Let F' be a finite group of diffeomorphisms (not necessarily isometries) 
of a Riemannian manifold (M,(- | -)z). Define a new inner product 
(-|-)%, on each tangent space TM by 


(u| vy, = S>(dfe(u) | dfe(v)) pce): 
fer 
a) Show that the Riemannian metric (- | -)’ on M is F-invariant. 
b) Show that if G is a group of isometries of (M,(- | -)z), and G is 
normalized by F’, then (- | -)’ is G-invariant. 


For any symmetric space X, show that there exist G, K, and o as in 
Proposition 1.2.2, such that X is isometric to G/K. 

[Hint: Suppose 7 is an involutive isometry of X with an isolated fixed point p. 
Let G = Isom(X)° and K = Stabg(p). Define o(g) = Tgr. Show K C Ca(a) 
and, using the fact that p is isolated, show that K contains the identity 
component of Cc(c).] 


Verify assertions (a), (b), (c), (d), and (e) of Example 1.2.3. 


[Hint: To prove transitivity in (b), you may assume that every symmetric 
matrix A is diagonalizable by an orthogonal matrix. That is, there exists g, 
such that gAg~' is diagonal and gg’ = Id. Note that every positive-definite 
diagonal matrix has a square root that is also a diagonal matrix.] 


Show that if X is a connected homogeneous space, then Isom(X) has 
only finitely many connected components. 

[Hint: Every component of G = Isom(X) intersects the compact group 
Stab, (x).] 


Show that if G is compact, then there is a G-invariant Riemannian 
metric on G that makes G a symmetric space. 


[Hint: The involutive isometry is g > g~'.] 
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§1.3. Locally symmetric spaces 


The gist of the following definition is that a locally symmetric space is a 
Riemannian manifold that is locally isometric to a symmetric space; that is, 
every point has a neighborhood that is isometric to an open subset of some 
symmetric space. 


(1.3.1) Definition. A complete Riemannian manifold M is locally sym- 
metric if its universal cover is a symmetric space. In other words, there is a 
symmetric space X, and a group [ of isometries of X, such that 

1) T acts freely and properly discontinuously on X, and 

2) M is isometric to [\X. 


(1.3.2) Remark. At every point of a symmetric space, the geodesic symme- 
try y(t) — y(—t) extends to an isometry of the entire manifold (see Corol- 
lary 1.1.9). In a locally symmetric space, the geodesic symmetry 7 at each 
point is an isometry on its domain, but it may not be possible to extend 7 to 
an isometry that is well-defined on the entire manifold; that is, the geodesic 
symmetry is only a local isometry. That is the origin of the term locally 
symmetric. 


(1.3.3) Example. Define g: 9? — §? by g(z) = z+ 1, let T = (g), and let 
M =T\S?. Then (obviously) M is locally symmetric. 

However, M is not symmetric. We provide several different geometric 
proofs of this fact, in order to illustrate the important distinction between 
symmetric spaces and locally symmetric spaces. (It can also be proved group- 
theoretically (see Exercise 2).) The manifold MW is a cusp: 


1) Any point far out in the cusp lies on a short loop that is not null- 
homotopic, but points at the other end do not lie on such a loop. 
Therefore, M is not homogeneous, so it cannot be symmetric. 


2) The geodesic symmetry performs a 180° rotation. Therefore, if it is a 
well-defined diffeomorphism of M, it must interchange the two ends of 
the cusp. However, one end is thin, and the other end is (very!) wide, 
so no isometry can interchange these two ends. Hence, the geodesic 
symmetry (at any point) is not an isometry, so M is not symmetric. 
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3) Let us show, directly, that the geodesic symmetry at some point p € §)? 
does not factor through to a well-defined map on ['\)? = M. 
e Let x = —1+72 and y = 1+, and let p € iR be the midpoint of 
the geodesic segment joining x and y: 


e Let 7 be the geodesic symmetry at p. Then r(x) = y= 1+. 
e Because the imaginary axis is a geodesic, we have r(i) = ai, for 


some a > 1. 
e Nowi=x2+1=4gQ/(x), so x and i represent the same point in M. 
However, 7(i) — T(x) = —1 + (a — 1) is not an integer (it is not 


even real), so T(x) and T(z) do not represent the same point in M. 
Therefore, 7 does not factor through to a well-defined map on M. 


(1.3.4) Remarks. 


1) Some authors do not require M to be complete in their definition of a 
locally symmetric space. This would allow the universal cover of M to 
be an open subset of a symmetric space, instead of the entire symmetric 
space. 

2) A more intrinsic (but more advanced) definition is that a complete, 
connected Riemannian manifold M is locally symmetric if and only 
if the curvature tensor of M is invariant under all parallel translations, 
and M is complete. 


Any complete, connected manifold of constant negative curvature is a 
locally symmetric space, because the universal cover of such a manifold is §)” 
(after normalizing the curvature to be —1). As a generalization of this, we 
are interested in locally symmetric spaces M whose universal cover M is of 
noncompact type, with no flat factors; that is, such that each irreducible 
factor of M is noncompact (and not flat). From Proposition 1.2.6, we see, 
in this case, that M can be written in the form M = G/K, where G = 
G, x --- x G, is a product of noncompact simple Lie groups, and K is a 
maximal compact subgroup of G. We have M = T\M , for some discrete 
subgroup [° of Isom(M iF We know that Isom(M ) has only finitely many 
connected components (see Exercise 1.2#9), so, if we replace M with an 
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Figure 1.3A. A fundamental domain F for SL(2,Z) in SL(2,R). 


appropriate finite cover, we can arrange that [TC Isom ( M ‘is = G. Then 
M =T\G/K, and I is a discrete subgroup of G. 

A topologist may like M to be compact, but it turns out that a very 
interesting theory is obtained by making the weaker assumption that M/ has 
finite volume. Hence, the subgroup [' should be chosen so that [\G/K has 
finite volume. Because ['\G is a principal K-bundle over [\G/K, and K has 
finite measure, it is not difficult to see, from Fubini’s Theorem, that [\G has 
finite volume (see Exercise 6). This leads to the following definition. 


(1.3.5) Definition. A subgroup I of G is a lattice in G if [ is discrete and 
I'\G has finite volume (which respect to the Haar measure on G). 


(1.3.6) Example. If T is discrete and [\G is compact, then T° is a lattice 
in G, because any compact Riemannian manifold obviously has finite volume. 


(1.3.7) Example. SL(2, Z) is a lattice in SL(2,R). To see this, let 
F={z€§ | |z|>1 and -1/2 < Rez < 1/2} (1.3.8) 
(see Figure 1.3A). It is well known (though not obvious) that F is a funda- 
mental domain for the action of SL(2,Z) on §? (see Exercises 7 and 8); it 
therefore suffices to show that F has finite volume, or, more precisely, finite 
hyperbolic area. 
The hyperbolic area dA of an infinitesimal rectangle is the product of its 
hyperbolic length and its hyperbolic width. If the Euclidean length is dx and 
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the Euclidean width is dy, and the rectangle is located at the point x + zy, 
then, by definition of the hyperbolic metric, the hyperbolic length is (dz) /(2y) 
and the hyperbolic width is (dy) /(2y). Therefore, 

dx dy 

Ay2 ° 

Since Imz > /3/2 for all z € F, we have 


We . Gy TG. a 
vol(F) =) as fo / aot a —, dy < 00. 
etiyeF V3/2 J—-1/2 V3/2 y? 


Unfortunately, SL(2,Z)\? is not a at symmetric space, because 
SL(2,Z) does not act freely on $7 (so the quotient space is not a Riemannian 
manifold). However, there are finite-index subgroups of SL(2,Z) that do act 
freely (cf. Theorem 4.8.2), and these provide interesting locally symmetric 
spaces. 


dA = 


Calculations similar to (but more complicated than) Example 1.3.7 show: 

e SL(n, Z) is a lattice in SL(n,R), and 

e SO(p, q) NSL(n, Z) is a lattice in SO(p, q). 
As in the example of SL(2,Z)\§?, the hard part is to find a fundamental 
domain for [\G (or an appropriate approximation of a fundamental domain); 
then it is not difficult to see that its volume is finite. These are special cases 


of the following general theorem, which implies that every simple Lie group 
has a lattice. 


(1.3.9) Theorem (Arithmetic subgroups are lattices (see Theorem 5.1.11)). 
Assume 


e G=G, X-:::X Gm is a product of simple Lie groups, 
e GCSL(Z,R), and 
e GNSL(¢, Q) is dense in G. 

Then Gz =GNSL(¢,Z) is a lattice in G. 


Lattices constructed by taking the integer points of G' in this way are said 
to be arithmetic (see Definition 5.1.19). (For most simple Lie groups, these 
are the only lattices (see Theorem 5.2.1).) When @ is large, there is more than 
one way to embed G in SL(@,R), and we will see that different embeddings 
can lead to quite different intersections with SL(¢, Z). In particular, if G is a 
noncompact, simple Lie group, then: 


e By taking an appropriate embedding of G in some SL(¢,R), we will 
construct a lattice [ in G, such that [\G is not compact (see Corol- 
lary 5.1.17). 


e By taking a different embedding, we will construct a different lattice 
I’, such that I’\G is compact (see Theorem 18.7.1). 
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We will also see that algebraic properties of I influence the geometry of 
the corresponding locally symmetric space M. In particular, the structure 
of I determines whether M is compact or not. (For example, the “Godement 
Criterion” (5.3.3) implies that M is compact if and only if every element 
of T is a diagonalizable matrix over C.) Much more generally, the following 
important theorem implies that every geometric property of M is faithfully 
reflected in some group-theoretic property of I. 


(1.3.10) Theorem (Mostow Rigidity Theorem (see Chapter 15)). Let Mi and 
My be finite-volume locally symmetric spaces (not both 2-dimensional), such 
that 


e the universal covers of My, and Mz are neither compact, nor flat, nor 
reducible, and 


e the volumes of M, and Mz are normalized (i.e., vol My = vol Mz = 1). 


If ™1(M1) = 771(Mz2), then My is isometric to M2. 
In fact, every homotopy equivalence is homotopic to an isometry. 


The theorem implies that locally symmetric spaces have no nontrivial 
deformations, which is why it is called a “rigidity” theorem: 


(1.3.11) Corollary. Let {g;} be a continuous family of Riemannian metrics 
on a manifold M with dim M > 2, such that, for each t: 


e (M,q) is a finite-volume locally symmetric space whose universal cover 
is neither compact, nor flat, nor reducible, and 


e vol(M,g,) = 1. 
Then (M, gz) is isometric to (M, go), for every t. 


(1.3.12) Definition. A locally symmetric space is irreducible if no finite 
cover of M can be written as a nontrivial cartesian product M, x Mo. 


It is important to note that the universal cover of an irreducible locally 
symmetric space need not be an irreducible symmetric space. In other words, 
there can be lattices in G, x --- x G, that are not of the form T; x --- x Ty, 
(see Example 5.5.3). 


(1.3.13) Remark. Theorem 1.3.10 (and the corollary) can be generalized to 
the case where only Mj, rather than the universal cover of Mj, is irreducible. 
However, this requires the hypotheses to be strengthened: it suffices to as- 
sume that no irreducible factor of M1 or Mo is either compact or flat or 
2-dimensional. Furthermore, the conclusion needs to be weakened: rather 
than simply multiplying by a single scalar to normalize the volume, there can 
be a different scalar on each irreducible factor of the universal cover. 
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Exercises for 81.3. 


#1. 


4.2. 


#3. 


4A, 


#69. 


#6. 


#7. 


#8. 


Let 
e X be asimply connected symmetric space, 
e T'\X be a locally symmetric space whose universal cover is X (so 
T is a discrete group of isometries that acts freely and properly 
discontinuously on X), and 
e 7 be an isometry of X. 
Show that if 7 factors through to a well-defined map on ['\X, then + 
normalizes [' (that is, ry7~! €T, for every y €T). 


Define g: 9? > 8? by g(z) = 2z4+1. 
a) Show the geodesic symmetry T at 7 is given by T(z) = —1/z. 
b) Show that 7 does not normalize (g). 
c) Conclude that 7 does not factor through to a well-defined map on 
(9)\9°. 
Let 


e X be asimply connected symmetric space, and 
e T'\X bea locally symmetric space whose universal cover is X (so 
I is a discrete group of isometries that acts freely and properly 
discontinuously on X). 
Show that X is homogeneous if and only if the normalizer NG(T) is 
transitive on X, where G = Isom(X). 


Let M =T\G/K bea locally symmetric space, and assume that G has 
no compact factors. Show that if NG(T)/T is finite, then Isom(M) is 
finite. 


Show that if K is any compact subgroup of a Lie group G, then there 
is a unique (up to a scalar multiple) G-invariant Borel measure v on 
G/K, such that v(C) < oo, for every compact subset C of G/K. 


Let 
e K be acompact subgroup of a Lie group G, and 
e I be a discrete subgroup of G that acts freely on G/K. 
Show that [\G has finite volume if and only if [\G/K has finite volume. 


Let T = SL(2,Z), and define F C $j? as in (1.3.8). Show, for each 
p € $9, that there is some y € T' with y(p) € F. 
[Hint: If Imy(p) < Imp for all y ET, and —1/2 < Rep < 1/2, then p € F |] 


Let [ = SL(2,Z), and define F c §)? as in (1.3.8). Show, for z,w € Ff, 
that if there exists y € T with y(z) = w, then either z = w or z,w € OF. 
[Hint: Assume Imw < z. Then |72,12 + 72,2| < 1. Hence |y2,1| € {0,1}. 
If |y21| = 1 and 42,2 4 0, then |Rez| = 1/2, so z € OF. If |yoi1| = 1 
and 72,2 = 0, then w = (az — 1)/z. Since |Re(1/z)| < |Rez| < 1/2, and 
|Rew| < 1/2, we see that either Rez = 1/2 or w = —-1/z.] 
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Notes 


Either of Helgason’s books [2, 3] is a good reference for the geometric ma- 
terial on symmetric spaces and locally symmetric spaces, the connection with 
simple Lie groups, and much more. Lattices are the main topic of Raghu- 
nathan’s book [8]. 

Theorem 1.3.9 is a result of Borel and Harish-Chandra [1] that will be 
proved in Chapters 7 and 19. 

Theorem 1.3.10 combines work of Mostow [5], Prasad [7], and Margulis 
[4]. We will discuss it in Chapter 15. 

Example 1.3.7 appears in many number theory texts, including [9, §7.1.2, 
pp. 77-79]. Our hints for Exercises 1.37 and 1.3#8 are taken from [6, 
Prop. 4.4, pp. 181-182]. 
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Chapter 2 


Geometric Meaning of 
IR-rank and Q-rank 


This chapter, like the previous one, is motivational. It is not a prerequisite 
for later chapters. 


§2.1. Rank and real rank 


Let X be a symmetric space (see Definition 1.1.5). For example, X could be 
a Euclidean space R”, or a round sphere $8”, or a hyperbolic space §)”, or a 
product of any combination of these. 

As is well known, the rank of X is a natural number that describes part 
of the geometry of X, namely, the dimension of a maximal flat. 


(2.1.1) Definition. A flat in X is a connected, totally geodesic, flat sub- 
manifold of X. 


(2.1.2) Definition. rank X is the largest natural number r, such that X 
contains an r-dimensional flat. 


Let us assume that X has no flat factors. (That is, the universal cover 
of X is not isometric to a product of the form Y x R”. Mostly, we will be 
interested in the case where X also does not have any compact factors.) 


You can copy, modify, and distribute this work, even for commercial purposes, all without 
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Ma ly symmetric spaces (Chapter 1) 


and other differential geometry. 
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Let G = Isom(X)°. Then G acts transitively on X, and there is a compact 
subgroup K of G, such that X = G/K. Because X has no flat factors, G 
is a connected, semisimple, real Lie group with trivial center (see §1.2). (We 
remark that G is isomorphic to a closed subgroup of SL(¢,R), for some 2.) 

The real rank can be understood similarly. It is an invariant of G that 
is defined algebraically (see Chapter 8), but it has the following geometric 
interpretation. 


(2.1.3) Theorem. rankp G is the largest natural number r, such that X con- 
tains a closed, simply connected, r-dimensional flat. 


(2.1.4) Warning. By closed, we simply mean that the flat contains all of its 
accumulation points, not that it is compact. (A closed, simply connected flat 
is homeomorphic to some Euclidean space R”.) 


For example, if X is compact, then every closed, totally geodesic, flat 
subspace of X must be a torus, not R”, so rankg G = 0. On the other hand, 
if X is not compact, then X has unbounded geodesics (for example, if X is 
irreducible, then every geodesic goes to infinity), so rankgG > 1. Hence: 

rankep G = 0 = X is compact. 
Thus, there is a huge difference between rankgpG = 0 and rankpG > 0, 
because no one would mistake a compact space for a noncompact one. 


(2.1.5) Remark. rankg G = rank X if and only if X has no compact factors. 


There is also an important difference between rankp G = 1 and rankp G > 
1. The following proposition is an important example of this. 


(2.1.6) Definition. X is two-point homogeneous if, whenever (71, 72) and 
(y1,Y2) are two pairs of points in X with d(x1,22) = d(y1, ye), there is an 
isometry g of X with g(x1) = y, and g(x2) = yo. 


If rankp G > 1, then there exist maximal flats H,; and H» that intersect 
nontrivially. On the other hand, there also exist some pairs 71,72, such that 
{@1,£2} is not contained in the intersection of any two (distinct) maximal 
flats. This establishes one direction of the following result. 


(2.1.7) Proposition. Assume X is noncompact and irreducible. The sym- 
metric space X is two-point homogeneous if and only if rankpG = 1. 


The following is an infinitesimal version of this result. 


(2.1.8) Proposition. Assume X is noncompact and irreducible. The action 
of G on the set of unit tangent vectors of X is transitive iff ranky G = 1. 


(2.1.9) Corollary. rankg SO(1,n) = 1. 


Proof. For G = SO(1,n), we have X = 9". The stabilizer SO(n) of a point 
in §)” acts transitively on the unit tangent vectors at that point. So G acts 
transitively on the unit tangent vectors of X. LJ 


2.2. Q-RANK 23 


More generally, it can be shown that rankg(SO(m,n)) = min{m,n}. 


Also, rankp (SL(n, R)) =n-—1. Although they may not be obvious geometri- 
cally, these real ranks are easy to calculate from the algebraic definition that 
will be given in Chapter 8. 


(2.1.10) Remark. For every r, there is a difference between rankg G = r and 
rankepG > r, but this difference is less important as r grows larger: the 
three main cases are rankyG = 0, rankgG = 1, and rankgG > 2. (This is 
analogous to the situation with smoothness assumptions: countless theorems 
require a function to be C® or C! or C?, but far fewer theorems require a 
function to be, say, C’, rather than only C°.) 


Exercises for §2.1. 
#1. Show ranky(G, x G2) = ranky G; + ranke Go. 


#2. Assume rankye G = 1. Show X is irreducible if and only if X has no 
compact factors. 


#3. Show that if X is reducible, then X is not two-point homogeneous. (Do 
not assume the fact about maximal flats that was mentioned, without 
proof, before Proposition 2.1.7.) 


§2.2. Q-rank 


Now let ['\X be a locally symmetric space modeled on X, and assume that 
I'\X has finite volume. Hence, [ is a (torsion-free) discrete subgroup of G, 
such that I'\G has finite volume; in short, [ is a lattice in G. 

The real rank depends only on X, so it is not affected by the choice 
of a particular lattice [. We now describe an analogous algebraically defined 
invariant, rankg I’, that does depend on I’, and therefore distinguishes between 
some of the various locally homogeneous spaces that are modeled on X. We 
will mention some of the geometric implications of Q-rank, leaving a more 
detailed discussion to later chapters. 


(2.2.1) Theorem (see Subsections 19.3(iii) and 19.3(iv)). 


1) rankgI is the largest natural number r, such that some finite cover of 
T'\X contains a closed, simply connected, r-dimensional flat. 


2) rankgT is the smallest natural number r, for which there exists collec- 
tion of finitely many closed, r-dimensional flats, such that all of T\X 
is within a bounded distance of the union of these flats. 


(2.2.2) Remark. It is clear from Theorem 2.2.1(1) that rankg IT always exists 
(and is finite). Furthermore, 0 < rankgIl < rankgG. Although not so 
obvious, it can be shown that the extreme values are always attained: there 
are lattices [, and [, in G with rankgI. = 0 and rankgI’, = rankpG 
(see Theorem 18.7.1 and Exercise 9.147). So it is perhaps surprising that 
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there may be gaps in between. (For example, if G = SO(2,n), with n > 5, 
and n is odd, then rankg G = 2, but Corollary 18.6.2 shows there does not 
exist a lattice [ in G, such that rankgT = 1.) 


(2.2.3) Example (see Example 9.1.5). From the algebraic definition, which 
will appear in Chapter 9, it is easy to calculate 
rankg(SO(m, n)z) = min{m,n} = rankg (SO(m, n)) 


and 


rankg(SL(n, Z)) =n —1=rankg(SL(n,R)). 


As for the real rank, the biggest difference is between spaces where the 
invariant is zero and those where it is nonzero, because this is again the 
distinction between a compact space and a noncompact one: 


2.2.4) Theorem (see Exercise 9.1445). rankg IT = 0 iff [\X 1s compact. 
Q 


Theorem 2.2.1(2) implies that the Q-rank of I is directly reflected in the 
large-scale geometry of [\X, as described by the asymptotic cone of ['\X. 
Intuitively, the asymptotic cone of a metric space is obtained by looking at 
it from a large distance. For example, if [\X is compact, then, as we move 
farther away, the manifold appears smaller and smaller (see the illustration 
below). In the limit, the manifold shrinks to a point. 


(SS) ee (2.2.5) 


An intuitive understanding is entirely sufficient for our purposes here, 
but, for the interested reader, we provide a more formal definition. 


(2.2.6) Definition. The asymptotic cone of a metric space (M,d) is the 
limit space 
lim, ((M, ed), p), 


E> 
if the limit exists. Here, p is an arbitrary (but fixed!) point of M, and the 
limit is with respect to Gromov’s Hausdorff distance. (Roughly speaking, a 
large ball around p in (M, ed) is 6-close to being isometric to a large ball 
around a certain (fixed) point po in the limit space (Mo, do).) 


(2.2.7) Examples. 


1) If [\X is compact, then the asymptotic cone of I'\X is a point, as is 
illustrated in (2.2.5). This point is a 0-dimensional simplicial complex, 
which is a geometric manifestation of the fact that rankg I = 0. 

2) If rankg G = 1, and ['\X is not compact, then, as is well known, [\X 
has finitely many cusps. The asymptotic cone of a cusp is a ray, so 
the asymptotic cone of [\X is a “star” of finitely many rays emanating 
from a single vertex (see Figure 2.2A). Therefore, the asymptotic cone 
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of I'\X is a 1-dimensional simplicial complex. This manifests the fact 
that rankgT = 1. 


Figure 2.2A. Looking at a manifold with cusps from farther 
and farther away. 


(2.2.8) Theorem (see Remark 19.3.9). The asymptotic cone of T\X is a 
simplicial complex whose dimension is rankg I. 


(2.2.9) Example. Let G = SL(3, R) and [ = SL(3, Z). From Theorem 2.2.8, 
we see that the asymptotic cone of [\G/K is a 2-dimensional simplicial com- 
plex. In fact, it turns out to be (isometric to) the sector 


{ a) eR? nyc el 


2 
(It is not a coincidence that this sector is a Weyl chamber of the Lie algebra 
$l(3, R).) 


(2.2.10) Remarks. 


1) If rankgT = 1, then the asymptotic cone of I'\X is a star of finitely 
many rays emanating from the origin (cf. Example 2.2.7(2)). Note that 
this intersects the unit sphere in finitely many points. 


2) In general, if rankgI’ = k, then the unit sphere contains a certain 
simplicial complex 7p of dimension k — 1, such that the asymptotic 
cone of I'\X is the union of all the rays emanating from the origin that 
pass through 7p. 

3) For T = SL(3,Z), the simplicial complex 7p is a single edge (cf. Ex- 
ample 2.2.9). In general, the Tits building 7G is a certain simplicial 
complex defined from the parabolic Q-subgroups of G, and 7p can be 
obtained from 7g by modding out the action of I. 


4) The asymptotic cone is also known as “tangent cone at infinity” 


(2.2.11) Remark. Although we will not prove this, the Q-rank is directly 
reflected in the cohomology of T\X. Namely, let c be the cohomological 
dimension of '\\X. Because ['\X is a manifold of dimension dim X, we have 
c= dim X if and only if [\X is compact. So the deficiency dim X — c is, in 
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some sense, a measure of how far I'\X is from being compact. This measure 
is precisely rankgI (if X has no compact factors). 


Exercises for §2.2. 


#1. Theorem 2.2.4 states that if [\X is compact, then rankgT = 0. 
a) Prove this directly from Theorem 2.2.1(1). 
b) Prove this directly from Theorem 2.2.1(2). 


Notes 


Helgason’s book [4] has a thorough treatment of rank and R-rank. 

Theorem 2.2.1(2) was proved by B. Weiss [8]. 

Theorem 2.2.4 was proved for arithmetic lattices by Borel and Harish- 
Chandra [1] and, independently, by Mostow and Tamagawa [7]. For non- 
arithmetic lattices, this theorem is part of the definition of Q-rank. 

A more precise version of Theorem 2.2.8 (providing a description of the 
geometry of the simplicial complex) was proved by Hattori [3]. Proofs also 
appear in [5] and [6]. 

Remark 2.2.11 is due to Borel and Serre [2]. 
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Chapter 3 


Brief Summary 


This book is about arithmetic subgroups, and other lattices, in semisimple Lie 
groups. Given a lattice I in a semisimple Lie group G, we will investigate both 
the algebraic structure of I’, and properties of the corresponding homogeneous 
space G/T’. We will also study the close relationship between G and [. For 
example, we will see that G is essentially the only semisimple group in which I 
can be embedded as a lattice (“Mostow Rigidity Theorem” ), and, conversely, 
we will usually be able to make a list of all the lattices in G (“Margulis 
Arithmeticity Theorem” ). 

This chapter provides a very compressed outline of the material in this 
book. To help keep it brief, let us assume, for the remainder of the chapter, 
that 


G is a noncompact, simple Lie group, and T is a lattice in G. 
This means (see Definition 4.1.9): 
e I’ is a discrete subgroup of G, and 


e the homogeneous space G/T has finite volume (with respect to the Haar 
measure on G). 


(If G/T is compact, which is a very important special case, we say [ is co- 
compact.) 
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Part I. Introduction 


All three chapters in this part of the book are entirely optional; none of the 
material will be needed later (although some examples and remarks do refer 
back to it). Chapters 1 and 2 provide geometric motivation for the study 
of arithmetic groups, by explaining the connection with locally symmetric 
spaces. The present chapter (Chapter 3) is a highly condensed version of the 
entire book. 


Part II. Fundamentals 


This part of the book presents definitions and other foundational material for 
the study of arithmetic groups. 


Chapter 4. Basic Properties of Lattices. This chapter presents a 
few important definitions, including the notions of lattice subgroups, com- 
mensurable subgroups, and irreducible lattices. It also proves a number of 
fundamental algebraic and geometric consequences of the assumption that 
is a lattice, including the following. 


(4.4.4) Recall that an element u of SL(n, R) is unipotent if its characteris- 
tic polynomial is (a — 1)” (or, in other words, its only eigenvalue is 1). If G/T 
is compact, then [’ does not have any nontrivial unipotent elements. This is 
proved by combining the Jacobson-Morosov Lemma (A5.8) with the obser- 
vation that if a sequence c;I° leaves all compact sets, and U is a precompact 
set in G, then, after passing to a subsequence, the sets UciT’, UcoI’,... are all 
disjoint. However, G/T has finite volume, so it cannot have infinitely many 
disjoint open sets that all have the same volume. 


(4.5411) (Borel Density Theorem) T is not contained in any connected, 
proper, closed subgroup of G. Assuming that G/T is compact, the key to 
proving this is to note that if p: G > GL(m,R) is any continuous homomor- 
phism, wu is any unipotent element of G, and v € R™, then the coordinates of 
the vector p(u*)v are polynomial functions of k. However, if G/T is compact, 
and v happens to be p(I)-invariant, then the coordinates are all bounded. 
Since every bounded polynomial is constant, we conclude that every p(I)- 
invariant vector is p(G)-invariant. From this, the desired conclusion follows 
by looking at the action of G on exterior powers of its Lie algebra. 


(4.7.10) [ is finitely presented. When G/T is compact, this follows from 
the fact that the fundamental group of any compact manifold is finitely pre- 
sented. For the noncompact case, it follows from the existence of a nice 
fundamental domain for the action of T on G (which will be explained in 
Chapter 19). 
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(4.8.2) (Selberg’s Lemma) T has a torsion-free subgroup of finite index. 
For example, if f = SL(3, Z), then the desired torsion-free subgroup can be 
obtained by choosing any prime p > 3, and taking the matrices in T that are 
congruent to the identity matrix, modulo p. 


(4.8#9) T is residually finite. For example, if f = SL(3,Z), then no 
nontrivial element of [I is in the intersection of the finite-index subgroups 
used in the preceding paragraph’s proof of Selberg’s Lemma. 


(4.9.2) (Tits Alternative) [ contains a nonabelian free subgroup. This 
is proved by using the Ping-Pong Lemma (4.9.6), which, roughly speaking, 
states that if homeomorphism a contracts all of the space toward one point, 
and homeomorphism b contracts all of the space toward a different point, then 
the group generated by a and 6 is free. 


(4.10.3) (Moore Ergodicity Theorem) If H is any noncompact, closed sub- 
group of G, then every real-valued, H-invariant, measurable function on G/T 
is constant (a.e.). The general case will be proved in Section 11.2, but sup- 
pose, for example, that G = SL(2,R), H = {a*} is the group of diagonal 
matrices, and f is an H-invariant function that, for simplicity, we assume is 
uniformly continuous. If we let {u’} be the group of upper-triangular matrices 
with 1’s on the diagonal, then we have 


ub-f=uta®-f=atue *-f SF asuo- f= f, 
so f is invariant under {u‘}. Similarly, it is also invariant under the group of 
lower-triangular matrices. So f is G-invariant, and therefore constant. 


Chapter 5. What is an Arithmetic Group? Roughly speaking, an 
arithmetic subgroup Gz of G is obtained by embedding G in some SL(@,R), 
and taking the resulting set of integer points of G. That is, Gz is the intersec- 
tion of G with SL(¢, Z). However, in order for Gz to be called an arithmetic 
subgroup, the embedding G — SL(¢, Z) is required to satisfy a certain tech- 
nical condition (“defined over Q”) (see Definition 5.1.2). 


(5.1.11) Every arithmetic subgroup of G is a lattice in G. This funda- 
mental fact will be proved in Chapters 7 and 19. 


(5.2.1) (Margulis Arithmeticity Theorem) Conversely, if G is neither 
SO(1,n) nor SU(1,n), then every lattice in G is an arithmetic subgroup. 
Therefore, in most cases, “arithmetic subgroup” is synonymous with “lat- 
tice? This amazing theorem will be proved in Section 16.3. 

It is a hugely important result. The definition of “lattice” is quite ab- 
stract, but a fairly explicit list of all the lattices in G can be obtained by 
combining this theorem with the classification of arithmetic subgroups that 
will be given in Chapter 18. 


(5.3.1) (Godement Compactness Criterion) G/Gz is compact if and only 
if the identity element is the only unipotent element of Gz. The direction 
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(=) is very elementary and was proved in the previous chapter (see 4.4.4). 
The converse uses the same main idea, combined with the simple observation 
that if a polynomial has integer coefficients, and all of its roots are close to 1, 
then all of its roots are exactly equal to 1. 


(5.5) The embedding of G in SL(@,R) is not at all unique, and different 
embeddings can yield quite different arithmetic subgroups Gz. One very im- 
portant method of constructing non-obvious embeddings is called Restriction 
of Scalars. It starts by choosing a field F' that is a finite extension of Q. If we 
think of F’ as a vector space over Q, then it can be identified with some Q”, 
in such a way that the ring O of algebraic integers of F' is identified with Z”. 
This implies that the group Go is isomorphic to GZ, where G’ is a semisimple 
group that has G as one of its factors. Therefore, this method allows arith- 
metic subgroups to be constructed not only from ordinary integers, but also 
from algebraic integers. 


Chapter 6. Examples of Arithmetic Groups. This chapter ex- 
plains how to construct many arithmetic subgroups of SL(2,R), SO(1, 7), 
and SL(n,R), by using unitary groups and quaternion algebras (and other di- 
vision algebras). (Restriction of scalars is also used for some of the cocompact 
ones.) It will be proved in Chapter 18 that these fairly simple constructions 
actually produce all of the arithmetic subgroups of these groups. 

(6.5) There exist non-arithmetic lattices in SO(1,n) for every n. This 
was proved by M.Gromov and I. Piatetski-Shapiro. They “glued together” 
two arithmetic lattices to create a “hybrid” lattice that is not arithmetic. 


Chapter 7. SL(n, Z) is a lattice in SL(n,R). This chapter explains 
two different proofs of the fundamental fact (already mentioned in Theo- 
rem 5.1.11) that Gz is a lattice in G, in the illustrative special case where 
G = SL(n,R) and Gz = SL(n, Z). 

The first proof is quite short and elementary, and is presented fairly com- 
pletely. It constructs a nice set that is (approximately) a fundamental domain 
for the action of [ on G. The key notion is that of a Szegel set. We begin 
with the Iwasawa decomposition G = KAN. 


e kK =SO(n) is a maximal compact subgroup of G. 


e The group A of diagonal matrices in G is isomorphic to R"~!, so we can 
think of it as a real vector space. Under this identification, the “simple 
roots” are linear functionals a1,...,Q@n,—, on A. Choose any t € R, and 
let 


A, ={a€A]|a;(a) >t for all 7}, 
so A; is a polyhedral cone in A. 
e N is the group of upper-triangular matrices with 1’s on the diagonal. 


e Choose any compact subset No of N. 
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Then the product 8 = No A; K is a Siegel set (see Section 7.2). It depends 
on the choice of t and No. 

A straightforward calculation shows that every Siegel set has finite volume 
(see Proposition 7.2.5). It is also not terribly difficult to find a Siegel set 8 
with the property that Gz-S = G (see Theorem 7.3.1). This implies that 
G/Gz has finite volume, so Gz is a lattice in G, or, in other words, SL(n, Z) 
is a lattice in SL(n, R). 

Unfortunately, some difficulties arise when generalizing this method to 
other groups, because it is more difficult to use Siegel sets to construct an 
appropriate fundamental domain in the general case. The main ideas will be 
explained in Chapter 19. 


So we also present a different proof that is much easier to generalize 
(see Section 7.4). Namely, the general case is quite easy to prove if one 
accepts the following key fact that was proved by Margulis: If 


e u’ is any unipotent 1-parameter subgroup of SL(n,R), and 
e x € SL(n,R)/SL(n, Z), 


then there is a compact subset C of SL(n,R)/SL(n, Z), and some € > 0, such 
that at least €% of the orbit {u’x}:er is in the set C (see Theorem 7.4.7). 


Part III. Important Concepts 


This part of the book explores several fundamental ideas that are important 
not only for their applications to arithmetic groups, but much more generally. 


Chapter 8. Real rank. This chapter defines the real rank of G, which is 
an important invariant in the study of semisimple Lie groups. It also describes 
some consequences of assuming that the real rank is at least two, and presents 
the definition and basic structure of the minimal parabolic subgroups of G. 


Chapter 9. Q-rank. This chapter, unlike the others in this part of the 
book, discusses a topic that is primarily of interest in the theory of arithmetic 
groups (and related algebraic groups). Largely parallel to Chapter 8, it defines 
the Q-rank of I’, describes some consequences of assuming that the Q-rank is 
at least two, and presents the definition and basic structure of the minimal 
parabolic Q-subgroups of G. 
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Chapter 10. Quasi-isometries. Any finite generating set S for I 
yields a metric dg on I: the distance from x to y is the minimal number 
of elements of S that need to be multiplied together to obtain x~'y. Unfortu- 
nately, this “word metric” is not canonical, because it depends on the choice 
of the generating set S. However, it is well-defined up to a bounded factor, so, 
to get a geometric object that is uniquely determined by I’, we consider two 
metric spaces to be equivalent (or quasi-isometric) if there is a map between 
them that only distorts distances by a bounded factor (see Definition 10.1.3). 


(10.1.7) Some quasi-isometries arise from cocompact actions: it is not 
difficult to see that if acts cocompactly, by isometries on a (nice) space X, 
then there is a quasi-isometry from I to X. Thus, for example, any cocompact 
lattice in SO(1,n) is quasi-isometric to the hyperbolic space $”. 


(10.2) Tis Gromov hyperbolic if and only if rankg G = 1 and T is compact, 
except that all lattices in SL(2,R) are hyperbolic, not only the cocompact 
ones. One direction is a consequence of the well-known fact that Z x Z is not 
contained in any hyperbolic group. The other direction (for the cocompact 
case) is a special case of the fact that the fundamental group of any closed 
manifold of strictly negative sectional curvature is hyperbolic. 


Chapter 11. Unitary representations. This chapter presents some 
basic concepts in the theory of unitary representations, the study of group 
actions on Hilbert spaces. The Moore Ergodicity Theorem (4.10.3) is proved 
in Section 11.2, and the “induced representations” defined in Section 11.3 
will be used in Section 13.4 to prove that [ has Kazhdan’s Property (7) if 
rankp G > 2. 


(11.2.2) (Decay of matrix coefficients) If 7 is a continuous homomorphism 
from G to the unitary group of a Hilbert space H, then 


lim (r(g)¢|w) =0, for all 6,w EH. 


Il g|| +00 
This yields the Moore Ergodicity Theorem (4.10.3) as an easy corollary, and 
the proof is based on the existence of a € G and (unipotent) subgroups UT 
and U~ of G, such that (Ut,U~) = G and a™ua~”" — e€ as n — 00 (or —o0), 
for all u € U* (or U~, respectively). 


(11.4.2) Every unitary representation of any compact Lie group is a direct 
sum of finite-dimensional, irreducible unitary representations. 


(11.5.3) Every unitary representation of any abelian Lie group is a direct 
integral of one-dimensional unitary representations. 


(11.6.4) Every unitary representation of G is a direct integral of irreducible 
unitary representations. 
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Chapter 12. Amenable Groups. Amenability is such a fundamental 
notion that it has very many quite different definitions, all of which determine 
exactly the same class of groups (see Definition 12.1.3 and Theorem 12.3.1). 
One useful choice is that a group A is amenable if every continuous action 
of A on a compact, metric space has a finite, invariant measure. 


(12.4.2) The fact that the lattice [ contains a nonabelian free subgroup 
(see Corollary 4.9.2) implies that it is not amenable. This is because subgroups 
of amenable groups are amenable (see Proposition 12.2.8), and free groups do 
have actions (such as the actions described in the Ping-Pong Lemma (4.9.6)) 
that do not have a finite, invariant measure. 


Even so, amenability plays an important role in the study of I, through 
the following observation: 


(12.6.1) (Furstenberg Lemma) If P is an amenable subgroup of G, and we 
have a continuous action of on some compact, metric space X, then there 
exists a measurable, [-equivariant map from G/P to the space Prob(X) of 
measures ps on X, such that u(X) = 1. To prove this, let F be the set of 
measurable, [-equivariant maps from G to Prob(X). With an appropriate 
weak topology, this is a compact, metrizable space, and P acts on it by 
translation on the right. Since P is amenable, there is a P-invariant, finite 
measure 44 on F. The barycenter of this measure is a fixed point of P in J, 
and this fixed point is a function on G that factors through to a well-defined 
T-equivariant map from G/P to Prob(X). 


Chapter 13. Kazhdan’s Property (T). To say [ has Kazhdan’s 
property (7) means that if a unitary representation of does not have any 
(nonzero) vectors that are fixed by I, then it does not almost-invariant vec- 
tors, that is, vectors that are moved only a small distance by the elements of 
any given finite subset of [ (see Definition 13.1.1). 


(13.1.5) Kazhdan’s property (7) is, in a certain sense, the antithesis of 
amenability: a discrete group cannot have both properties unless it is finite. 
This is because the regular representation of any amenable group has almost- 
invariant vectors. 


(13.1.7) Every discrete group with Kazhdan’s property (7) is finitely gen- 
erated. To see this, let H = @,, £L°(A/F), where F ranges over all the finitely 
generated subgroups of A. Then, by construction, every finite subset of A 
fixes some nonzero vector in H. 


(13.2.4) G has Kazhdan’s property (7), unless G is either SO(1,n) or 
SU(1,n). To prove this for G = SL(3, R), first note that the semidirect prod- 
uct SL(2,R) x R? can be embedded in G. Also note that there are elements a 
and 6 of SL(2,R), such that, if Q is any of the 4 quadrants of R?, then either 
aQ or b@ is disjoint from Q (except for the 0 vector). Applying this to the 
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Pontryagin dual of R? implies that if a representation of the semidirect prod- 
uct SL(2, IR) x R? has almost-invariant vectors, then it must have a nonzero 
vector that is invariant under R?. This vector must be invariant under all of 
SL(3,R), by a generalization of the Moore Ergodicity Theorem that is called 
the Mautner phenomenon (11.2.8). 

(13.4.1) [ has Kazhdan’s property (7), unless G is either SO(1,7) or 
SU(1,n). Any unitary representation 7 of [ can be “induced” to a represen- 
tation @ of G. If z has almost-invariant vectors, then the induced represen- 
tation has almost-invariant vectors, and, since G has Kazhdan’s property (T), 
this implies that 7¢ has G-invariant vectors. Any such vector must come from 
a [T-invariant vector in 7. 


(13.5.4) A group has Kazhdan’s property (7) if and only if every action of 
the group by (affine) isometries on any Hilbert space has a fixed point. This 
is not at all obvious, but here is the proof of one direction. 

Suppose [ does not have Kazhdan’s property (7), so there exists a unitary 
representation of I on some Hilbert space H that has almost-invariant vectors, 
but does not have invariant vectors. Choose an increasing chain F C Fy C--- 
of finite subsets whose union is all of [. Since H has almost-invariant vectors, 
there exists a unit vector v, € H, such that || fu, —vp|| < 1/2” for all f € Fy. 
Now, define a: T > H°® by 


Ode = n(gvn — Un). 
Then a is a 1-cocycle, so defining g*v = gv+a(qg) yields an action of T on the 
Hilbert space H°. Since H has no nonzero invariant vectors, it is not difficult 
to see that a is an unbounded function on I’, so a is not a coboundary. This 
implies that the corresponding action on H™ has no fixed points. 


Chapter 14. Ergodic Theory. Ergodic Theory can be defined as the 
measure-theoretic study of group actions. In this category, the analogue of the 
transitive actions are the so-called ergodic actions, for which every measurable, 
invariant function is constant (a.e.) (see Definition 14.2.1). 

(14.3.2) (Pointwise Ergodic Theorem) If Z acts ergodically on X, with 
finite invariant measure, and f is any L!-function on X, then the average of f 
on almost every Z-orbit is equal to the average of f on the entire space X. 

(14.4.3) Every measure-preserving action of G can be measurably decom- 
posed into a union of ergodic actions. 

(14.5.10) If the action of G on a space X is ergodic, with a finite, invariant 
measure, then the action of G on X x X is also ergodic. 


Part IV. Major Results 


Here are some of the major theorems in the theory of arithmetic groups. 
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Chapter 15. Mostow Rigidity Theorem. 


(15.1.2) (Mostow Rigidity Theorem) Suppose T; is a lattice in Gj, for 
i= 1,2, and gy: Ty; ~ To. If G; has trivial center and no compact factors, 
and is not PSL(2,R), then y extends to an isomorphism @: G1 > Go. 

In most cases, the desired conclusion is a consequence of the Margulis 
Superrigidity Theorem, which will be discussed in Chapter 16. However, a 
different proof is needed when G; = Gz = SO(1,n) (and some other cases). 
Assuming that the lattices are cocompact, the proof uses the fact (mentioned 
in Proposition 10.1.7) that [; and Tz are quasi-isometric to 9”. Comparing 
the two embeddings yields a quasi-isometry y from §)” to itself. By proving 
that this quasi-isometry induces a map on the boundary that is conformal 
(i.e., preserves angles), it is shown that the two embeddings are conjugate by 
an isometry of S)”. 


(15.3.6) Mostow’s theorem does not apply to PSL(2,R): in this group, 
there are uncountably many lattices that are isomorphic to each other, but 
are not conjugate. This follows from the fact that there are uncountably 
many different right-angled hexagons in the hyperbolic plane 7. A compact 
surface of genus g can be constructed by gluing 4g — 4 of these hexagons 
together, in such a way that the fundamental group is a cocompact lattice 
in PSL(2,R). The uncountably many different hexagons yield uncountably 
many non-conjugate lattices. 


(15.4.1) From Mostow’s Theorem, we know that lattices in two different 
groups G; and G2 cannot be isomorphic. In fact, the lattices cannot even 
be quasi-isometric. Some ideas in the proof of this fact are similar to the 
argument of Mostow’s theorem, but we omit the details. 


Chapter 16. Margulis Superrigidity Theorem. 


(16.1) (Margulis Superrigidity Theorem) Suppose p: T —+ GL(n,R) is a 
homomorphism. If G is neither SO(1,7) nor SU(1,7), and mild hypotheses 
are satisfied, then p extends to a homomorphism 7: G > GL(n,R). 

Assuming rankpG > 2, a proof is presented in Section 16.5. Start by 
letting H be the Zariski closure of p(T‘), and let Q be a parabolic subgroup 
of H. Furstenberg’s Lemma (12.6.1) provides a T-equivariant map w: G/P > 


Prob(RP”). By using “proximality? ~ can be promoted to a map w :G/A7> 
R” (where A is a maximal R-split torus of G). Thus, we have an A-invariant 
(measurable) section of the flat vector bundle over G/T that is associated to y. 
Since G is generated by the centralizers of nontrivial, connected subgroups 
of A, this implies there is a finite-dimensional, G-invariant space of sections 
of the bundle, from which it follows that y has the desired extension to a 
homomorphism defined on all of G. 


(16.2.1) This theorem of Margulis is a strengthening of the Mostow Rigid- 
ity Theorem (15.1.2), because the homomorphism p is not required to be an 
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isomorphism. (On the other hand, Mostow’s theorem applies to the groups 
SO(1,n) and SU(1,7), which are not allowed in the superrigidity theorem.) 


(16.2.3) In geometric terms, the superrigidity theorem implies (under mild 
hypotheses) that flat vector bundles over G/T become trivial on a finite cover. 


(16.3) (Margulis Arithmeticity Theorem) If G is neither SO(1,n) nor 
SU(1,n), then the superrigidity theorem implies that every lattice in G is 
an arithmetic subgroup (as was stated without proof in Theorem 5.2.1). 

The basic idea of the proof is that if there is some p(7) with a matrix entry 
that is transcendental, then composing p with arbitrary elements of the Galois 
group Gal(C/Q) would result in uncountably many different n-dimensional 
representations of . Since G has only finitely many representations of each 
dimension, this would contradict superrigidity. Thus, we conclude that p(T) C 
GL(n,Q). By using a p-adic version of the superrigidity theorem, Q can be 
replaced with Z. 

(16.8) For groups of real rank one, the proof of superrigidity described in 
Section 16.5 does not apply, because A does not have any nontrivial, proper 
subgroups. Instead, a more geometric approach is used (but only a brief sketch 
will be provided). Let X and Y be the symmetric spaces associated to G 
and H, respectively, where H is the Zariski closure of p(T). By minimizing 
a certain energy functional, one can show there is a harmonic I’-equivariant 
map w: X — Y. Then, by using the geometry of X and Y, it can be shown 
that this harmonic map must be a totally geodesic embedding. This provides 
an embedding of the isometry group of X in the isometry group of Y. In 
other words, an embedding of G in H. 


Chapter 17. Normal Subgroups of I. 


(17.1.1) If rankg G > 2, then T is almost simple. More precisely, every 
normal subgroup of I either is finite, or has finite index. This is proved by 
showing that if N is any infinite, normal subgroup of I, then the quotient 
T'/N is amenable. Since [/N has Kazhdan’s property (7) (because we saw 
in Proposition 13.4.1 that [ has this property), this implies ['/N is finite. 


(17.2.1) On the other hand, if rankg G = 1, then [ is very far from being 
simple — there are many, many infinite normal subgroups of I’. In fact, T° is 
“SQ-universal? which means that if A is any finitely generated group, then 
there is a normal subgroup N of TI’, such that A is isomorphic to a subgroup 
of [/N (see Theorem 17.2.5). 


Chapter 18. Arithmetic Subgroups of Classical Groups. The 
main result of this chapter is the table on page 380 that provides a list of all 
of the arithmetic subgroups of G (unless G is either an exceptional group or 
a group whose complexification G'c is isogenous to SO(8,C)). Inspection of 
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the list establishes several results that were stated without proof in previous 
chapters. 


(18.4) It was stated without proof in Section 6.8 that every arithmetic 
subgroup of SL(n, R) is either a special linear group or a unitary group (if we 
allow division algebras in the construction). The proof of this fact is based on 
a calculation of the group cohomology of Galois groups (or Galois cohomology, 
for short). To introduce this method in a simpler setting, it is first proved that 
the only R-forms of the complex Lie group SL(n, C) are SL(n, R), SL(n/2, H), 
and SU(k, @) (see Section 18.3). 

(18.5) The same methods show that all the Qforms of any classical 
group G are classical groups (except that there is a problem when G is a 
real form of SO(8,C) (see Remark 18.5.10)). However, we do not provide the 
calculations. 


(18.7.4) We say that a semisimple group H = G, x --- x G, is isotypic if 
all the simple factors of Hc are isogenous to each other. A theorem of Borel 
and Harder (18.7.3) on Galois cohomology implies that if H is isotypic, then 
it has an arithmetic subgroup that is irreducible: it is not commensurable to 
a nontrivial direct product [Ty x Iz. (The converse follows from the Margulis 
Arithmeticity Theorem unless H is either SO(1,n) x K or SU(1,n) x K.) 


Chapter 19. Construction of a Coarse Fundamental Domain. 
This chapter presents some of the main ideas involved in the construction of 
a nice subset of G that approximates a fundamental domain for G/T (when [ 
is an arithmetic subgroup). This generalizes the construction for l = SL(n, Z) 
that was explained in Chapter 7. 

As in Chapter 7, the key notion is that of a Siegel set. The main differ- 
ence is that, instead of the maximal R-split torus A, we must work with a 
subtorus T of A that is Q-split, not merely R-split: 


e K is a maximal compact subgroup of G (same as before), 

e Sis a maximal Q-split torus in G, 

e S; is a sector in S, 

e P is a minimal parabolic Q-subgroup of G that contains S$, and 
e Po is a compact subset of P. 


Then K'S;Pp is a Siegel set for T in G (see Definition 19.1.2). 

It may not be possible to find a Siegel set 8, such that 5-T = G (see Ex- 
ample 19.2.1). (When dimS = 1, this is because each Siegel set can only 
cover one cusp, and G/T may have several cusps.) However, there is always a 
finite union of (translates of) Siegel sets that will suffice (see Theorem 19.2.2). 


The existence of a nice set F, such that # - TI = G, has important conse- 
quences, such as the fact that I is finitely presented (see Subsection 19.3(i)). 
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This fact was stated in Theorem 4.7.10, but could only be proved for the 
cocompact case there. 


Chapter 20. Ratner’s Theorems on Unipotent Flows. If {a‘} is 
any l-parameter group of diagonal matrices in G, then there are {a‘}-orbits 
in G/T that have bad closures: the closure is a fractal. M. Ratner proved that 
if a subgroup V is generated by 1-parameter unipotent subgroups, then it is 
much better behaved: the closure of every V-orbit is a C°° submanifold of 
G/T (see Theorem 20.1.3). 


This theorem has important consequences in geometry and number the- 
ory. As a sample application in the theory of arithmetic groups, we mention 
that if [; and Ig are any two lattices in G, then the subset I; [2 of G is either 
discrete or dense (see Corollary 20.2.6). This is proved by letting Tl = Ty x Tp 
in G x G, and letting V be the diagonal embedding of G in the same group. 

Ratner proved that the actions of 1-parameter unipotent subgroups on 
G/T also have nice measurable properties: every finite, invariant probabil- 
ity measure is the Haar measure on a closed orbit of some subgroup of G 
(see Theorem 20.3.4), and every dense orbit is uniformly distributed (see The- 
orem 20.3.3). 


We will not prove Ratner’s theorems, but some of the ideas in the proof 
will be described. One of the main ingredients is called “shearing” (see Sec- 
tion 20.4). For example, suppose G = SL(2,R) and V = {u*} is a 1-parameter 
unipotent subgroup. Then the key point is that if « and y are two nearby 
points in G/T (and are not on the same {u‘}-orbit), then the fastest relative 
motion between the two points is along the V-orbits. More precisely, there is 


some t, such that u's is close to either u’tty or u’ty. 


Appendices 


The main text is followed by three appendices. The first two (appendices A 
and B) recall some facts that are used in the main text. The third (Appen- 
dix C) defines the notion of S-arithmetic group, and quickly summarizes how 
the results on arithmetic groups extend to this more general setting. 


Part II 


Fundamentals 


Chapter 4 


Basic Properties of Lattices 


This book is about lattices in semisimple Lie groups (with emphasis on the 
“arithmetic” ones). 


(4.0.0) Standing Assumptions. Throughout this book: 

1) G is a linear, semisimple Lie group (see Appendix Al 
for an explanation of these terms), with only finitely many 
connected components, and 

2) Tis a lattice in G (see Definition 4.1.9). 


Similar restrictions apply to the symbols G1, G2, G’, Ty, Ta, I”, etc. 


(4.0.1) Remark. Without losing any of the main ideas, it may be assumed, 
throughout, that G is either SL(n,R) or SO(m,n) (or a product of these 
groups), but it is best if the reader is also acquainted with the other “classical 
groups, such as unitary groups and symplectic groups (see Definition A2.1). 


Three definitions in this chapter are very important: lattice subgroups 
(4.1.9), commensurable subgroups (4.2.1), and irreducible lattices (4.3.1 and 4.3.3). 
The rest of the material in this chapter may not be essential for a first read- 
ing, and can be referred back to when necessary. However, if the reader has 
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no prior experience with lattices, then the basic properties discussed in Sec- 
tion 4.1 will probably be helpful. 


§4.1. Definition 


(4.1.1) Lemma. Jf A is a discrete subgroup of G, then there is a strict 
fundamental domain for G/A in G. That is, there is a Borel subset F 
of G, such that the natural map F > G/A, defined by g +> gA, is bijective. 


Proof. Since A is discrete, there is a nonempty, open subset U of G, such 
that (U-1U) NA = {e}. Since G is second countable (or, if you prefer, 
since G is o-compact), there is a sequence {g,,} of elements of G, such that 
Ur 9nU = G. Let 


F= UJ (sts Usa), 
n=1 <n 

Then F is obviously Borel, and it is a strict fundamental domain for G/A 

(see Exercise 2). oO 


(4.1.2) Remark. 


1) The above lemma is stated for the space G/A of left cosets of A, but, 
in some situations, it is more natural to work with the space A\G of 
right cosets. In this book, we will feel free to use whichever is most 
convenient at a particular time, and leave it to the reader to translate 
between the two, by using the fact that the function gA + Ag@! is a 
homeomorphism from G/A to A\G (see Exercises 3 and 4). Our choice 
will usually be determined by the preference for most mathematicians 
to write their actions on the left. (Therefore, if G is acting, then we 
will tend to use G/A, but if we are thinking of A as acting on G, then 
we usually consider the quotient A\G.) 


2) Definitions in the literature vary somewhat, but saying that a subset F 

of Gis a fundamental domain for G/A typically means: 

(a) FA=G, 

(b) F is a closed set that is nice: its interior F is dense in F, and its 

boundary F \ F has measure 0, and 

(c) FANF =9, for all nonidentity \ € A. 
It is not difficult to see that if F is a fundamental domain, then it has 
a Borel subset F’, such that F’ is a strict fundamental domain and 
F ~ F' has measure 0. This means that, for many purposes (such as 
calculating integrals), it suffices to have a fundamental domain, rather 
than finding a set that is precisely a strict fundamental domain. 
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(4.1.3) Proposition. Let A be a discrete subgroup of G, and let u be Haar 
measure on G. There is a unique (up to a scalar multiple) o-finite, G- 
invariant Borel measure v on G/A. More precisely: 


1) For any strict fundamental domain F, the measure v can be defined by 
v(A/A) = WAN F), (4.1.4) 
for every Borel set A in G, such that AA = A. 
2) Conversely, for A CG, we have 
p(A) = #:(AN xA) dv(axA). (4.1.5) 
Gin 


Proof. See Exercises 7 and 8 for (1) and (2). The uniqueness of v follows 
from (2) and the uniqueness of the Haar measure wp. 


(4.1.6) Remark. We always assume that the G-invariant measure v on G/A 
is normalized so that (4.1.4) and (4.1.5) hold. 


(4.1.7) Corollary. Let A be a discrete subgroup of G, and let 6: G > G/A 
be the natural quotient map ¢(g) = gA. If A is a Borel subset of G, such that 
the restriction $|,4 is injective, then v(d(A)) = pA). 

(4.1.8) Remarks. 


1) The Haar measure ys on G is given by a smooth volume form, so the 
associated measure v on G'/A is also given by a volume form. Therefore, 
we say that G/A has finite volume if v(G/A) < co. 


2) The assumption that A is discrete cannot be eliminated from Proposi- 
tion 4.1.3. However, a G-invariant measure on G'/A can be constructed 
under the weaker assumption that A is closed and unimodular (see Ex- 
ercise 9). 

(4.1.9) Definition. A subgroup [ of G is a lattice in G if 
e I is a discrete subgroup of G, and 
e G/T has finite volume. 


(4.1.10) Remark. The definition is not vacuous: we will explain in Corol- 
lary 5.1.16 that G does have at least one lattice (in fact, infinitely many), 
although part of the proof will be postponed to Chapter 7. 


(4.1.11) Proposition. Let A be a discrete subgroup of G, and let 4 be Haar 
measure on G. The following are equivalent: 

1) A is a lattice in G. 

2) There is a strict fundamental domain F for G/A, with u(F) < oo. 

3) There is a strict fundamental domain F' for A\G, with u(F’) < oc. 

4) There is a Borel subset C of G, such that CA =G and p(C) < oo. 
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Proof. (1 = 2) From Equation (4.1.4), we have v(G/A) = u(F). Therefore, 
G/A has finite volume if and only if u(F) < co. 

(2 = 3) If F is any strict fundamental domain for G/A, then F~! is a 
strict fundamental domain for A\G (see Exercise 4). Since G is unimodular, 
we have u(F~') = (F) (see Exercise 5). 

(2 = 4) Obvious. 

(4 => 1) We have Cn xd # 0, for every x € G, so, from (4.1.5), we see 
that 


v(G/A) = i 1ldv(aA) < #(C ON aA)dv(Az) = pw(C)<oco. O 
G/A G/M 


(4.1.12) Example. As mentioned in Example 1.3.7, SL(2,Z) is a lattice in 
SL(2, R). 


(4.1.13) Definition. A closed subgroup A of G is cocompact (or uniform) 
if G/A is compact. 
(4.1.14) Corollary. 

1) Every cocompact, discrete subgroup of G is a lattice. 


2) Every finite-index subgroup of a lattice is a lattice. 


Proof. Exercises 12 and 13. O 


(4.1.15) Remark. Lattices in G are our main interest, but we will occasionally 
encounter lattices in Lie groups H that are not semisimple. If H is unimod- 
ular, then all of the above results remain valid with H in the place of G. In 
contrast, if H is not unimodular, then Proposition 4.1.3 may fail: there may 
exist a discrete subgroup A, such that there is no H-invariant Borel measure 
on H/A. Instead, there is sometimes only a semi-invariant measure V: 
v(hA) = A(h) v(A), 
where A is the modular function of H (see Exercise 14). 


For completeness, let us specifically state the following concrete general- 
ization of Definition 4.1.9 (cf. 4.1.11). 
(4.1.16) Definition. A subgroup A of a Lie group H is a lattice in H if 

e A is a discrete subgroup of H, and 


e there is an H-invariant measure v on H/A, such that v(H/A) < oo. 


(4.1.17) Example. Z” is a cocompact lattice in R”. 
(4.1.18) Proposition. If a Lie group H has a lattice, then H is unimodular. 


Proof. Let ¥ be a strict fundamental domain for H/A. The proof of Propo- 
sition 4.1.3 shows v(A/A) = v(ANF), for every Borel set A in G, such that 
AA = A. Then Exercise 14 implies v(hA/A) = A(h)v(A/A). In particular, 
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we see that v(H/A) = A(h)v(H/A), by letting A = HA (and noting that 


hH = 


H). Since v(H/A) < oo, this implies A(h) = 1, as desired. 


Exercises for §4.1. Recall that (in accordance with the Standing Assump- 
tions (4.0.0)), [ is a lattice in G, and G is a semisimple Lie group. 


#1. 
#2. 


#3. 


4A, 


#5. 


#6. 


HT. 


#8. 


Show that I is finite if and only if G is compact. 


Complete the proof of Lemma 4.1.1; that is, show that F is a strict 
fundamental domain. 


Define f: G/A + A\G by f(gA) = Ag7!. Show that f is a homeomor- 
phism. 


Show Lemma 4.1.1 easily implies an analogous statement that applies 
to right cosets. More precisely, show that if 

e A is a discrete subgroup of G, 

e F isa strict fundamental domain for G/A, and 

eo F ={x'|reF}, 
then the natural map F~' > A\G, defined by g > Ag, is bijective. 


Show that (A~!) = (A) for every Borel subset A of G. 


[Hint: Defining y’(A) = p(A7') yields a G-invariant measure on G. The 
uniqueness of Haar measure implies yu’ = jz. Where did you use the fact that 
G is unimodular?] 


Let 
e A be a discrete subgroup of G, 
e F and F’ be strict fundamental domains for G/A, 
e yw be Haar measure on G, and 
e A be a Borel subset of G. 
Show: 
a) For each g € G, there is a unique \ € A, such that gd € F. 
b) For each \ € A, if we let A, = {a € A| ad © F}, then A) is 
Borel, and A is the disjoint union of the sets { A, |A€ A}. 
c) WF) = wl"). 
d) If AA = A, then w(ANF) = wW(ANF’). 


Show, for every Haar measure py on G, that the Borel measure v defined 
in Proposition 4.1.3(1) is G-invariant. 
(Hint: For any g € G, the set gF is a strict fundamental domain. From 


Exercise 6(d), we know that v is independent of the choice of the strict 
fundamental domain F.| 


If A is a discrete subgroup of G, and v is a o-finite, G-invariant Borel 
measure on G'/A, show that the Borel measure jz defined in Proposi- 
tion 4.1.3(2) is G-invariant. 
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Let H be a closed subgroup of G. Show that there is a o-finite, G- 
invariant Borel measure v on G/H if and only if H is unimodular. 


[Hint: (=) For a left Haar measure p on H, define a left Haar measure pw 
on G by 


p(A) =f Pe ANH) dv(xH). 


Then w(A) = Ax(h) u(Ah) for h € H, where Ay is the modular function 
of H. Since G is unimodular, we must have Ay = 1.] 


Show that if A is a discrete subgroup of G that contains I, then A isa 
lattice in G, and T has finite index in A. 


[Hint: Let F be a strict fundamental domain for G/A, and let F' be a set of 
coset representatives for T in A. Then ¥- F is a strict fundamental domain 
for G/T, and therefore has finite measure.| 


Let A be a discrete subgroup of G. Show that a subset A of G/A is 
precompact if and only if there is a compact subset C’ of G, such that 
ACCA/A. 


[Hint: (<=) The continuous image of a compact set is compact. (=) Let U 
be a cover of G by precompact, open sets.] 


Prove Corollary 4.1.14(1). 
[Hint: Exercise 11 and Proposition 4.1.11(4).] 


Prove Corollary 4.1.14(2). 


[Hint: Proposition 4.1.11. A finite union of sets of finite measure has finite 
measure. | 


Let 

e H bea Lie group, 

e A be a discrete subgroup of A, 

e 4 be the right Haar measure on H, and 

e F be a strict fundamental domain for H/A. 
Define a o Borel measure v on H/A by v(A/A) = u(AN F), for every 
Borel set A in H, such that AA = A. Show v(hA/A) = A(h) v(A/A), 
where A is the modular function of H. 
[Hint: Cf. Exercise 7.] 


Show that every discrete, cocompact subgroup of every Lie group is a 
lattice. 


[Hint: Define v as in Exercise 14. Since v(H/A) < co (why?), we must have 
Aw) = 1, 


§4.2. Commensurability and isogeny 


We usually wish to ignore the minor differences that come from passing to a 
finite-index subgroup. The following definition describes the resulting equiv- 
alence relation. 
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(4.2.1) Definition. We say that two subgroups A; and Ag of a group H are 
commensurable if A, Aq is a finite-index subgroup of both A; and Ag. This 
is an equivalence relation on the collection of all subgroups of H (see Exer- 
cise 1). 

(4.2.2) Examples. 

1) Two cyclic subgroups aZ and bZ of R are commensurable if and only 
if a is a nonzero rational multiple of b; therefore, commensurability of 
subgroups generalizes the classical notion of commensurability of real 
numbers. 


2) It is easy to show that every subgroup commensurable to a lattice is 
itself a lattice. (For example, this follows from Corollary 4.1.14(2) and 
Exercise 4.14410.) 


The analogous notion for Lie groups (with finite center and finitely many 
connected components) is called “isogeny:” 


(4.2.3) Definitions. 
1) G, is isogenous to G2 if some finite cover of (G;)° is isomorphic to 
some finite cover of (G2)°. This is an equivalence relation. 
2) A (continuous) homomorphism y: G; > G2 is an isogeny if it is an 
isomorphism modulo finite groups. More precisely: 
e the kernel of ¢ is finite, and 
e the image of y has finite index in Go. 


(4.2.4) Remark. The following are equivalent: 
1) G, is isogenous to Go. 
2) Ad(G1)° & Ad(G2)°. 
3) G, and G2 are locally isomorphic, that is, the Lie algebras g; and ge 
are isomorphic. 


4) There is an isogeny from some finite cover of (G,)° to Go. 


The normalizer of a subgroup is very important in group theory. Because 
we are ignoring finite groups, the following definition is natural in our context. 


1 


(4.2.5) Definition. An element g of G commensurates T if gl'g~* is com- 


mensurable to [’. Let 
Comme(L) = {g € G | g commensurates T }. 


This is called the commensurator of T. 


(4.2.6) Remark. The commensurator of [is sometimes much larger than the 
normalizer of [. For example, let G = SL(n,R) and [ = SL(n,Z). Then 
Nec(L) is commensurable to [ (see Corollary 4.5.5), but Commeg(I) contains 
SL(n, Q)(see Exercise 4.84411), so Commg(I) is dense in G, even though 
Ne(L) is discrete. Therefore, in this example (and, more generally, whenever 
I is “arithmetic” ), NG(T) has infinite index in Commg(L). 
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On the other hand, if G = SO(1,n), then it is known that there are 
examples in which T, NcG(0), and Commg(I) are commensurable to each 
other (see Exercise 5.2#3 and Corollary 6.5.16). 


(4.2.7) Definition. We say that two groups A; and A» are abstractly com- 
mensurable if some finite-index subgroup of A, is isomorphic to some finite- 
index subgroup of Ag. 


Note that if A; and Ag are commensurable, then they are abstractly 
commensurable, but not conversely. 


Exercises for §4.2. 


#1. Verify that commensurability is an equivalence relation. 


#2. If [; is commensurable to Iz, show Commg(T)) = Comme (F2). 


§4.3. Irreducible lattices 


Note that [, x Ig is a lattice in G, x Gp. A lattice that can be decomposed 
as a product of this type is said to be reducible. 


(4.3.1) Definition. [ is irreducible if .N is dense in G, for every noncom- 
pact, closed, normal subgroup N of G° (and I is infinite, or, equivalently, 
G is not compact). 


(4.3.2) Example. If G is simple (and not compact), then every lattice in G 
is irreducible. Conversely, if G is not simple, then not every lattice in G is 
irreducible. To see this, assume, for simplicity, that G is connected and has 
trivial center (and is not compact). Then we may write G as a nontrivial 
direct product G = G; x G2, where each of G; and G2 is semisimple. If we 
let [; be any lattice in G;, for 7 = 1,2, then I’, x [2 is a reducible lattice in G. 


The following proposition shows (under mild assumptions) that every 
lattice is commensurable to a product of irreducible lattices. Therefore, the 
preceding example provides essentially the only way to construct reducible 
lattices, so most questions about lattices can be reduced to the irreducible 
case. We postpone the proof, because it relies on some results from later in 
this chapter. 


(4.3.3) Proposition (see proof on page 57). Assume 
e G has trivial center, and 
e I’ projects densely into the maximal compact factor of G. 


Then there is a direct-product decomposition G = G, x --- x G,, such that T 
is commensurable to Tl, x---xI,, where T; =TOG;, and 1; is an irreducible 
lattice in G;, for each 1. 
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For readers familiar with locally symmetric spaces, these results can be 
restated in the following geometric terms. 


(4.3.4) Definition. Recall that a locally symmetric space ['\X is irreducible 
if there do not exist (nontrivial) locally symmetric spaces [';\X1 and T'2\Xo, 
such that the product ([y\X1) x (('2\X2) finitely covers T\X. 


The following is obvious by induction on dim X. 


(4.3.5) Proposition. There exist locally symmetric spaces T4\X1,...,0-\Xr 
that are irreducible, such that the product ((,\X1) x --: x (T;\X,-) finitely 
covers T\X. 


The following is a restatement of Proposition 4.3.3 (in the special case 
where G has no compact factors). 


(4.3.6) Proposition. Let M be an irreducible locally symmetric space, such 
that the universal cover X of M has no compact factors, and no flat factors. 
For any nontrivial cartesian product decomposition X = X1 x X2 of X, the 
image of X1 is dense in M. 


We will see in Example 5.5.3 that SL(2,R) x SL(2, R) has an irreducible 
lattice (for example, a lattice isomorphic to SL (2, Z[V2])). More generally, 
Corollary 18.7.4 shows that G has an irreducible lattice if all the simple factors 
of the “complexification” of G are isogenous to each other. The converse is 
proved in Theorem 5.6.2, under the additional assumption that G has no 
compact factors. 


Exercises for §4.3. 


#1. Show that if T is irreducible, then I projects densely into the maximal 
compact factor of G. 


§4.4. Unbounded subsets of ['\G 


Geometrically, looking at the fundamental domain described in Example 1.3.7 
makes it clear that the sequence {ni} tends to oo in SL(2, Z)\S. In this sec- 
tion, we give an algebraic criterion that determines whether or not a sequence 
tends to oo in G/T, without any need for a fundamental domain. 

Recall that the injectivity radius of a Riemannian manifold X is the 
maximal r > 0, such that, for every « € X, the exponential map is a dif 
feomorphism on the open ball of radius r around x. If X is compact, then 
the injectivity radius is nonzero. The following proposition shows that the 
converse holds in the special case where X = ['\G/K is locally symmetric of 
finite volume. 
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(4.4.1) Proposition. For g € G, define ¢,: G— G/T by ¢4(x) =agIT. The 
homogeneous space G/T is compact if and only if there is a nonempty, open 
subset U of G, such that, for every g € G, the restriction dg|u of dg to U is 
injective. 

Proof. (=) Define 6: G— G/T by ¢(x) = aI. Then ¢ is a covering map, so, 
for each p € G/T, there is a connected neighborhood V, of p, such that the 
restriction of ¢ to each component of d~'(V,) is a diffeomorphism onto V,. 
Since { V, | p € G/T} is an open cover of G/T’, and G/T is compact, there is 
a connected neighborhood U of e in G, such that, for each p € G/T, there is 
some p’ € G/T, with Up C V, (see Exercise 1). Then ¢,|y is injective, for 
each g € G. 

(<) We prove the contrapositive. Let U be any nonempty, precompact, 
open subset of G. (We wish to show, for some g € G, that ¢g|y is not 
injective.) If C is any compact subset of G/T’, then, because G/T is not 
compact, we have 

(G/T)~ (UC) FO. 
Hence, we may inductively construct a sequence {g,,} of elements of G, such 
that the open sets ¢,,(U),¢9,(U),... are pairwise disjoint. Since G/T has 
finite volume, these sets cannot all have the same volume, so, for some n, the 
restriction @g, |v is not injective (see Corollary 4.1.7). O 


Let us restate this geometric result in algebraic terms. 


(4.4.2) Notation. For elements a and 6 of a group H, and subsets A and B 
of H, let 


’a = bab“, Pa —{'a|be B}, 
"A={alac A}, BA= {9 | ae Abe Bh. 


(4.4.3) Corollary. G/T is compact if and only if the identity element e is 
not an accumulation point of CT. 


Proof. We have 
dglu is injective = fus,u2 €U, wgl =uggF and us 4 u2 
& 9M (U~1U) = {fe}. oO 


This has the following interesting consequence. 


(4.4.4) Corollary. If T has a nontrivial, unipotent element, then G/T is not 
compact. 


Proof. If u is a nontrivial, unipotent element of I’, then, from the Jacobson- 
Morosov Lemma (A5.8), we know there is a continuous homomorphism 


¢: SL(2,R) 3 G, with ¢ F t] ah Lee Es | € G. Then 
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cael olla alo Gal) 
-#([o 71'])>#([0 a) = 


Therefore, e is an accumulation point of “u, so Corollary 4.4.3 implies that 
G/T is not compact. 


(4.4.5) Remarks. 


1) If G has no compact factors, then the converse of Corollary 4.4.4 is 
true. However, we will prove this only in the special case where I is 
“arithmetic” (see Section 5.3). 


2) In general (without any assumption on compact factors), it can be 
shown that G/T is compact if and only if every element of I’ is semisim- 
ple (see Exercise 5.3446). 


The proofs of Proposition 4.4.1 and Corollary 4.4.3 establish the following 
more general version of those results. 


(4.4.6) Proposition (see Exercise 2). Let A be a lattice in a Lie group H, 
and let C be a subset of H. The image of C in H/A is precompact if and only 
if the identity element e is not an accumulation point of CA. 


The following is a similar elementary result that applies to the important 
special case where G = SL(@,R) andl = SL(@, Z), without relying on the fact 
that SL(£, Z) is a lattice. 


(4.4.7) Proposition (Mahler Compactness Criterion). Let C C SL(é,R). 
The image of C in SL(€,R)/SL(¢, Z) is precompact if and only if 0 is not an 
accumulation point of 


CZ’ ={cv| ce CvEeZ}. 


Proof. (=) Since the image of C in SL(¢, R)/SL(¢, Z) is precompact, there is 
a compact subset Co of G, such that C’ C Co SL(¢, Z) (see Exercise 4.1411). 
There is no harm in assuming that C = Co SL(¢,Z) (by enlarging C’). Then 
C(Z‘~ {0}) = Co(Z*‘~ {0}) is closed (since Z‘ \ {0}, being discrete, is closed 
and Cp is compact), so C(Z‘~ {0}) contains all of its accumulation points. In 
addition, since 0 is fixed by every element of C, we know that 0 ¢ C(Z‘~ {0}). 
Therefore, 0 is not an accumulation point of C(Z*‘ \ {0}). 

(<) To simplify the notation (while retaining the main ideas), let us 
assume £ = 2 (see Exercise 6). Suppose {g,} is a sequence of elements 
of SL(2,R), such that 0 is not an accumulation point of U, gnZ?. We 
wish to show there is a sequence {y,,} of elements of SL(2,Z), such that 
{gnYn} has a convergent subsequence. 
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For each n, let 


e vy, € Z ~ {0}, such that ||gnvn|| is minimal, 


e tn: R? — Rgnvn and t+: R? — (Rgnvn)+ be the orthogonal projec- 
tions, and 


e wn € Z? \ Run, such that ||7+(gnwn)|| is minimal. 


By replacing w, with w,+kv,, for an appropriately chosen k € Z, we may as- 
sume ||7n(gnWn)|| < ||GnVn||/2. Then, since the minimality of ||g,v,|| implies 
I|InUn| < |GnWn| , we have 


II9nvnl 
2 


Ine] < [lie Gnwn)Il + [htm (Gnwn)l] < lle (Gnwn)I| + 5 


sO 


Il9nvnl 


lt (Gnwn)|| > (4.4.8) 


Let C’ be the convex hull of {0,vn,w,} and (thinking of v, and wy, as 
column vectors) let 7, = [un Wn| € Matex2(Z). From the minimality of 
l9nUn|| and ||72(gnWn)||, we see that CNZ? = {0, un, Wn} (see Exercise 7), so 
det ¥, = +1 (see Exercise 8). Therefore, perhaps after replacing w,, with —wn, 
we have 7, € SL(2,Z). Since y,[}] = vn and y,[?] = wn, we may assume, 
by replacing gy, with gn7¥n, that 


Al anid wg = [7s 
Note that 
llr (GnWn)I| - Ilgn%nll = det gn = 1. (4.4.9) 


By combining this with (4.4.8), we see that {g,v,} is a bounded sequence, so, 
by passing to a subsequence, we may assume gnUn converges to some vector v. 
By assumption, we have v ¥ 0. 

Now, from (4.4.9), and the fact that |lgnvun|| — ||v|| is bounded away 
from 0, we see that ||7+(gnwn)|| is bounded. Because ||7n(gnWn)|| is also 
bounded, we conclude that ||g,w,|| is bounded. Hence, by passing to a sub- 
sequence, we may assume gy, Wp converges to some vector w. From (4.4.8), we 
know that ||72(gnwn)|| 4 0, so w ¢ Rv. 

Since v # 0 and w ¢ Ru, there is some g € GL(é,R) with g[}] = v and 
g[?] = w. We have 


A 


Galt | = 9nUn 7 V = glo] 
and, similarly, gn[°] > g[?], so gnx > gx for all x € R?. Therefore, gn > 9, 
as desired. Oo 


Exercises for §4.4. 


#1. Suppose a Lie group H acts continuously on a compact topological 
space /, and VY is an open cover of M. Show that there is a neighbor- 
hood U of e in H, such that, for each m € M, there is some V € Y 
with Um CV. 


#2. 


#3. 


4A, 


#5. 


#6. 


#7. 


#8. 


#9. 
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(Hint: This is analogous to the fact that every open cover of a compact 
metric space has a “Lebesgue number” Each m € M is contained in some 
Vin € V. Choose V,, containing m, and a neighborhood Uj, of e, such that 
Ui,Vin GC Vm. Cover M with finitely many V;,,.] 


Prove Proposition 4.4.6. 
[Hint: See the proofs of Proposition 4.4.1 and Corollary 4.4.3.] 


Use Proposition 4.4.6 to show that if H is a closed subgroup of G, such 
that H/T is a lattice in H, then the natural inclusion map H/((TNH) @ 
G/T is proper. 

[Hint: It suffices to show that if C is a subset of H, such that the image of C 
in G/T is precompact, then the image of C in H/(['M H) is also precompact.| 


Let G = SL(2,R), [ = SL(2,Z), and A be the subgroup consisting of 
all the diagonal matrices in G. Show that the natural inclusion map 
A/(T1 NA) @ G/T is proper, but [NA is not a lattice in A. (Therefore, 
the converse of Exercise 3 does not hold.) 


Let G = SL(8,R), A = SL(8,Z), and a = diag(1/2, 1, 2) € H. Show 
that a"A > co in G/A as n > oo. That is, show, for each compact 
subset C' of G/A, that, for all sufficiently large n, we have a"A ¢€ C. 
(For the purposes of this exercise, do not assume that A is a lattice 
in G.) 


Prove Proposition 4.4.7(<) without assuming ¢ = 2. 
[Hint: Extend the definition of vn and wn to an inductive construction of 
vectors U1,n,---,Uen € Z*] 


Suppose that v and w are linearly independent vectors in R’, and x = 
av + bw, with a,b > 0 anda+6< 1. Show that either 

e x € {v, w}, or 

e |[a|] < [ul], or 

e x ¢ Rv and d(z, Rv) < d(w, Rv). 
[Hint: Either b= 1, or b=0, or 0<b< 11 


Let C be the convex hull of {0,v,w}, where v and w are linearly 
independent vectors in Z?. Show that if CN Z? = {0,v,wh, then 
det [v w| Sash: 

[Hint: Let P be the the convex hull of {0,v, w,v + w}, so |det [v w]| is the 


area of P. If this area is > 1, then the translates of P by elements of Z? 
cannot be a tiling of R?.] 


Let H be a closed subgroup of G. 
a) Show that if [1 H is a lattice in H, then HT is closed in G. 
b) Show that the converse holds if H is normal in G. 


[Hint: (a) Exercise 3. (b) Since G/T is a bundle over G/(HT) with fiber 
HT /T, Fubini’s Theorem implies that HT’/T has finite volume. So the H- 
equivariantly homeomorphic space H/(['M H) also has finite volume.| 
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#10. Suppose 
e A is a non-cocompact lattice in a topological group H, and 
e H has a compact, open subgroup K. 
Show that A has a nontrivial element of finite order. 


[Hint: Since K is compact and A is discrete, it suffices to show that some 
conjugate of A intersects K nontrivially.] 


§4.5. Borel Density Theorem and some consequences 


The results in this section require the minor assumption that I projects 
densely into the maximal compact factor of G. This hypothesis is auto- 
matically satisfied (vacuously) if G has no compact factors. Recall that G° 
denotes the identity component of G (see Notation 1.1.2). 


(4.5.1) Theorem (Borel). Assume 


e I’ projects densely into the maximal compact factor of G, 


e V is a finite-dimensional vector space over R or C, and 


e p: G—> GL(V) is a continuous homomorphism. 
Then: 
1) Every p(T)-invariant vector in V is p(G°)-invariant. 


2) Every p(P)-invariant subspace of V is p(G°)-invariant. 


Proof. The proof is not difficult, but, in order to get to the applications more 
quickly, we will postpone it until Section 4.6. For now, to illustrate the main 
idea, let us just prove (1), in the special case where G/T is compact (and G 
is connected). Assume also that G has no compact factors (see Exercise 1); 
then G is generated by its unipotent elements (see Exercise 2), so it suffices to 
show that v is invariant under p(u), for every nontrivial unipotent element u 
of G. Because p(u) is unipotent (see Exercise 3), we know that p(u”)v is a 
polynomial function of n (see Exercise 4). However, because G/T is compact 
and p([)uv = v, we also know that p(G)v is compact, so { p(u”)v | n € N} 
is bounded. Every bounded polynomial is constant, so we conclude that 
p(u™)v = v for all n; in particular, p(u)v = p(u')u = v, as desired. O 


(4.5.2) Corollary. Assume T projects densely into the maximal compact fac- 
tor of G. If H is a connected, closed subgroup of G that is normalized by T, 
then H is normal in G°. 


Proof. The Lie algebra § of H is a vector subspace of the Lie algebra g of G. 
Also, because I. normalizes H, we know that § is invariant under AdgI. 
From Theorem 4.5.1(2), we conclude that § is invariant under AdG®°. Since 
H is connected, this implies that H is a normal subgroup of G°. L] 
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(4.5.3) Corollary. If T projects densely into the maximal compact factor of G 
(and G is connected), then Cg(T) = Z(G). 


Proof. Recall that G C SL(@,R), for some @ (see the Standing Assump- 
tions (4.0.0)). Let V = Maty,¢(IR) be the vector space of all real ¢ x £ matrices, 
soG CV. For g €G and vé€ V, define p(g)v = gug~!, so p: G > GL(V) is 
a continuous homomorphism. If c € Cg(I'), then p(y)c = yey! = c for every 
y €T, so Theorem 4.5.1(1) implies that p(G)c = c. Therefore c € Z(G). 


(4.5.4) Corollary. Assume I projects densely into the maximal compact fac- 
tor of G (and G is connected). If N is a finite, normal subgroup of T, then 
NC ZG). 


Proof. The quotient ['/ Cr(JV) is finite, because it embeds in the finite group 
Aut(N), so Cr(NV) is a lattice in G (see Corollary 4.1.14(2)). Then, because 
N CCa(Cr(N)), Corollary 4.5.3 implies N C Z(G). 


(4.5.5) Corollary. If T projects densely into the maximal compact factor 
of G, then T has finite index in its normalizer Ng(T). 


Proof. By passing to a subgroup of finite index, we may assume G is con- 
nected. Because I is discrete, the identity component NG(L)° of Nc¢(T) must 
centralize T. So Ne(T)° € Ce(L) = Z(G) is finite. On the other hand, 
Nea(T)° is connected. Therefore, NG(T)° is trivial, so NG(P) is discrete. 
Hence [ has finite index in Nc(L) (see Exercise 4.1410). 


(4.5.6) Corollary (Borel Density Theorem). Jf T projects densely into the 
maximal compact factor of G (and G is connected), then T is Zariski dense 
inG. That is, ifQ € Rivia,..., xe] is a polynomial function on Mate ¢(R), 
such that Q(T) = 0, then Q(G) = 0. 


Proof. Let 


= {Q E R[r1,1,---, Lee] | Q(T) = Ot. 
From the definition of Q, it is obvious that [ is contained in the corresponding 
variety Var(Q) (see Definition A4.1). Since Var(Q) has only finitely many 
connected components (see Theorem A4.6), this implies that Var(Q)° is a 
connected subgroup of G that contains a finite-index subgroup of IT’. Hence 
Corollary 4.5.2 implies that Var(Q)° = G (see Exercise 11), so G C Var(Q), 
as desired. 


With the above results, we can now provide the proof that was postponed 
from page 50: 


Proof of Proposition 4.3.3. We may assume [ is reducible (otherwise, 
let r = 1). Hence, there is some noncompact, connected, closed, normal 
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subgroup N of G, such that NT is not dense in G; let H be the closure of 
NT, and let H, = H°. Because [ C H, we know that I normalizes H,, so 
H is a normal subgroup of G (see Corollary 4.5.2 and Exercise 4.3#1)). 

Let Ay = H, NT. By definition, H; is open in H, so the subgroup Ai. 
is also open in H. It is therefore closed, so Aj is a lattice in H, (see Exer- 
cise 4.4449). 

Because H; is normal in G and G is semisimple (with trivial center), there 
is a normal subgroup H2 of G, such that G = H, x Ho (see Exercise A1#6). 
Let A = Hi 1 (H2I) be the projection of [ to Hy. Now I normalizes Aj, and 
Hz centralizes A,, so A must normalize A,. Therefore Corollary 4.5.5 implies 
that A is discrete (hence closed), so H2l = A x Hg is closed, so Ag = H2NT 
is a lattice in Hz (see Exercise 4.4#9). 

Because A, is a lattice in H, and Ag is a lattice in H2, we know that 
A, xX Ag is a lattice in Hy; x Hj = G. Hence, A; x Ag has finite index in I 
(see Exercise 4.1410). 

By induction on dim G, we may write 


Ay =G\ yn x G, and Ay = Goai1 Ress ye On 
so that MG; is an irreducible lattice in G;, for each 7. | 


(4.5.7) Remark. For simplicity, the statement of Theorem 4.5.1 assumes that 
T projects densely into the maximal compact factor of G. Without this as- 
sumption, the proof of Theorem 4.5.1(1) establishes the weaker conclusion 
that v is p(S)-invariant, for every noncompact, simple factor S of G. This 
leads to alternate versions of the corollaries that make no assumption about 
the compact factor of G. For example, the analogue of Corollary 4.5.2 states 
that if H is a connected, closed subgroup of G that is normalized by I, then 
H is normalized by every noncompact, simple factor of G. 


Exercises for §4.5. 


#1. Prove 4.5.1(1), under the assumption that G/T is compact (but allowing 
G to have compact factors). 


[Hint: The above proof shows that v is invariant under the the product N of 
all the noncompact factors of G. So it is invariant under the closure of NT, 
which is G.] 


#2. Show that if G is connected, and has no compact factors, then it is 
generated by its unipotent elements. 


[Hint: Consider each simple factor of G individually. The conjugates of a 
unipotent element are also unipotent.| 


#3. Suppose p: G — SL(£,R) is a continuous homomorphism. Show that if 
u is unipotent, then p(u) is unipotent. 


[Hint: The Jacobson-Morosov Lemma (A5.8) allows you to assume G = 


SL(2,R) and u = E tl . Then a sequence of conjugates of wu converges to Id, 


4A, 


#5. 
#6. 


HT. 


#8. 


#9. 


4£10. 


#11. 


#12. 


#13. 


4614. 
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so the characteristic polynomial of p(w) is the same as the characteristic 
polynomial of Id.] 


Show that if u is a unipotent element of SL(¢,R) and v € R*, then each 
coordinate of the vector u” v is a polynomial function of n. 

[Hint: Let u = Id+T, where T**? = 0. Then u®v = (Id4+T)"v = 
ae. (,) Tv] 


Show that if G is not compact, then [I is not abelian. 


Generalizing Exercise 5, show that if G is not compact, then T' is not 
solvable. 


Strengthening Exercise 5, show that if G is not compact, then the com- 
mutator subgroup [I I] is infinite. 


Assume the hypotheses of Theorem 4.5.1, and that G is connected. For 
definiteness, assume that V is a real vector space. For any subgroup H 
of G, let V|H] be the R-span of { p(h) | h € H} in End(V). Show that 
VIT] = V[G). 


Show the identity component of NG(I) is contained in the maximal 
compact factor of G. 


[Hint: Apply Corollary 4.5.5 to G/K, where K is the maximal compact 
factor. ] 


Show that if G is not compact, then I has an element that is not 
unipotent. 


[Hint: Any unipotent element ¥ of SL(é,R) satisfies the polynomial (~—1)* = 
0.] 


Assume G has no compact factors. Show that if H is a connected, 
closed subgroup of G that contains a finite-index subgroup of I, then 
5 i Oe 

[Hint: H is normalized by [1M H, so H «G°.] 


Assume G' has trivial center and no compact factors. Show that I is 
reducible if and only if there is a finite-index subgroup I’ of I’ such that 
I’ is isomorphic to A x B, for some infinite groups A and B. 
(Actually, although you do not need to prove it, there is no need to assume 
the center of G is trivial. This is because [ has a subgroup of finite index 
that is torsion free (see Theorem 4.8.2), and therefore does not intersect the 
center of G.) 


Show that if T is irreducible, then N OT is finite, for every connected, 
closed, normal subgroup N of G, such that G/N is not compact. 
[Hint: See the proof of Proposition 4.3.3.] 


Let p; and p2 be finite-dimensional, real representations of G. Assume 
G is connected, and has no compact factors. Show that if the restric- 
tions pi|p and p2|p are isomorphic, then p; and p2 are isomorphic. 
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[Hint: We are assuming p;: G — GL(n,R), for some n, and that there is 
some A € GL(n,R), such that pi(g) = Ap2(g)A~', for all y € T. You wish 
to show there is some A’ € GL(n,R), such that the same condition holds for 
all g € G, with A’ in the place of A. The Borel Density Theorem implies 
that you may take A’ = A.] 


§4.6. Proof of the Borel Density Theorem 


The proof of the Borel Density Theorem (4.5.1) is based on the contrast 
between two behaviors. On the one hand, if u is a unipotent matrix, and v is 
a vector that is not fixed by u, then some component of uw” v is a nonconstant 
polynomial, and therefore tends to +oo with n. On the other hand, the 
following observation implies that if v is fixed by [, then some subsequence 
converges to a finite limit. 


(4.6.1) Lemma (Poincaré Recurrence Theorem). Let 
e (X,d) be a metric space, 
e T: X > X be a homeomorphism, and 
e yu be a T-invariant measure on X, such that u(X) < oo. 


Then, for almost every x € X, there is a sequence nz — 00, such that T"*x > 
i 


Proof. Let 
A-={aEX|Vm>0, d(Tz,x) >}. 
It suffices to show p(A_) = 0 for every e. 

Suppose 4(A.) > 0. Then we may choose a subset B of A,, such that 
u(B) > 0 and diam(B) < ¢«. Because the sets B,T~'B,T~*B,... all have 
the same measure, and p(X) < oo, they cannot all be disjoint: there exists 
m <n, such that T~"BOT—"B 4 0. By applying T”, we may assume n = 0. 
For x € T~-™ BOB, we have Tx € B and x € B, so 

d(T’ x, x) < diam(B) <e. 
This contradicts the definition of A,. | 


(4.6.2) Remark. Part (1) of Theorem 4.5.1 is a corollary of Part (2). Namely, 
if v is p()-invariant, then the 1-dimensional subspace Rv (or Cv) is also 
invariant, so (2) implies that the subspace is p(G)-invariant. Since G has no 
nontrivial homomorphism to the abelian group R* (or C%), this implies that 
the vector v is p(G)-invariant. 


However, we will provide a direct proof of (1), since it is quite short (and 
a little more elementary than the proof of (2)). 


Proof of Theorem 4.5.1(1). Suppose v is a vector in V that is fixed by 
p(L). It suffices to show, for every unipotent u € G, that v is fixed by p(w). 
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Since u is p(I)-invariant, the map p induces a well-defined map p: G/T > 
V, defined by p(gl') = p(g)v. Since p is G-equivariant, it pushes the G- 
invariant, finite measure v on G/T to a p(G)-invariant, finite measure 7 on V. 
Therefore, Lemma 4.6.1 implies, for a.e. g € G, that {p(u”g)v} has a conver- 
gent subsequence. 

However, each component of the vector p(u"g)v is a polynomial function 
of n (see Exercise 4.5444). Therefore, the preceding paragraph implies, for 
ae. g € G, that p(u"g)v is constant (independent of n). This means that 
p(g)v is fixed by p(u). Since this is true for a.e. g, we conclude, by continuity, 
that it is true for all g, including g = e. Hence, v is fixed by wu. 


To prepare for the proof of Theorem 4.5.1(2), we make a few observations 
about the action of G on the projectivization of V. 


(4.6.3) Proposition. Assume 


e G has no compact factors, 


e V is a finite-dimensional vector space over R or C, 

e p:G—>GL(V) is a continuous homomorphism, and 

e 4 is a p(G)-invariant measure on the projective space P(V). 
If u(P(V)) < oo, then ys is supported on the set of fired points of p(G°). 
Proof. We know that G®° is generated by its unipotent elements (see Exer- 
cise 4.5##2), so it suffices to show that ju is supported on the set of fixed points 


of p(u), for every unipotent element u of G. 
Let 


e u bea unipotent element of G, 
e T = p(u) — Id, and 
e vEV\ {0}. 
Then T is nilpotent (because p(w) is unipotent (see Exercise 4.5443) ), so there 
is some integer r > 0, such that T’v 4 0, but T’*'v = 0. We have 
e(u)Tv = (Id+T)(T'v) = Tv + Ty = TV +0=T'D, 
so [Tv] € P(V) is a fixed point for p(w). Also, for each n € N, we have 


noorsi= Be ()r]-[(0) EG) 


k=0 
(because, for k < r, we have (7)/(") + 0 as n + oo). Therefore, p(u”)[v] 
converges to a fixed point of p(w), as n > oo. 

The Poincaré Recurrence Theorem (4.6.1) implies, for ji-almost every 
[vu] € P(V), that there is a sequence nz — oo, such that p(u™*){v] > [v]. On 
the other hand, the preceding paragraph tells us that p(u”*)|v] converges to 


a fixed point of p(w). Therefore, ji-almost every element of P(V) is a fixed 


> (Tv 
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point of p(u). In other words, yz is supported on the set of fixed points of 
p(u), as desired. O 


The assumption that G has no compact factors cannot be omitted from 
Proposition 4.6.3. For example, the usual Lebesgue measure is an SO(n)- 
invariant, finite measure on S”~!, but SO(n) has no fixed points on $”~}. 
We can, however, make the following weaker statement. 


(4.6.4) Corollary. Assume 


e V is a finite-dimensional vector space over R or C, 


e p: G> GL(V) is a continuous homomorphism, and 
e 4 is a p(G)-invariant measure on the projective space P(V). 


If u(P(V)) < oo, then there is a cocompact, closed, normal subgroup G' of G, 
such that y is supported on the set of fixed points of p(G’). 


Proof. Let K be the maximal connected, compact, normal subgroup of G, 
and write G = G’ x K, for some closed, normal subgroup G’ of G. Then G’ 
has no compact factors, so we may apply Proposition 4.6.3 to the restriction 
pla’. O 


It is now easy to prove the other part of Theorem 4.5.1: 


Proof of Theorem 4.5.1(2). By passing to a subgroup of finite index, we 
may assume G is connected. For simplicity, let us also assume that G has no 
compact factors (see Exercise 1). 

Suppose W is a subspace in V that is fixed by p(T), and let d= dimW. 
Note that p induces a continuous homomorphism p: G —> GL(A\“V), and, 
since W is p(T )-invariant, the 1-dimensional subspace ae W is p(T)-invariant. 
Hence, p induces a well-defined map p: G/T —> P(A“ V), with pel) = 
A“ W]. Then, since — is G-equivariant, it pushes the G-invariant, finite 
measure v on G/T to a p(G)-invariant, finite measure 7 on P(A! V). Then 
Proposition 4.6.3 tells us that p(G/T) is contained in the set of fixed points 
of p(G). In particular, [\“ W] = p(eL) is fixed by A(G). This means that W 
is p(G)-invariant. O 


(4.6.5) Remark. The proofs of the two parts of Theorem 4.5.1 never use the 
fact that the lattice I: is discrete. Therefore, I can be replaced with any 
closed subgroup H of G, such that there is a G-invariant, finite measure on 
G/H, and H projects densely into the maximal compact factor of G. 


Exercises for §4.6. 


#1. Complete the proof of Theorem 4.5.1(2), by removing the assumption 
that G has no compact factors. 
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[Hint: See the hint to Exercise 4.5#1.] 


#2. Let H be aclosed subgroup of G that projects densely into the maximal 
compact factor of G. Show that if there is a G-invariant, finite measure 
on G'/H, then the identity component H° is a normal subgroup of G°. 
[Hint: Remark 4.6.5 and the proof of Corollary 4.5.2.] 


§4.7. T is finitely presented 


(4.7.1) Definitions. Let A be a group. 


1) Ais finitely generated it has a finite generating set. That is, there is 
a finite subset of A that is not contained in any proper subgroup of A. 


2) A is finitely presented it has a presentation with only finitely many 
generators and finitely many relations. In other words, there exist: 
e a finitely generated free group F, 
e a surjective homomorphism ¢: F' > A, and 
e a finite subset R of the kernel of ¢, 
such that ker ¢ is the smallest normal subgroup of F' that contains R. 


It is easy to see that every finitely presented group is finitely generated. How- 
ever, the converse is not true. 


In this section, we describe the proof that I is finitely presented. Much 
like the usual proof that the fundamental group of any compact manifold is 
finitely presented, it is based on the existence of a nice set that is close to 
being a fundamental domain for the action of [ on G. 


(4.7.2) Definition. Suppose I acts properly discontinuously on a topological 
space Y. A subset F of Y is a coarse fundamental domain for I if 

1) TF =Y, and 

2) {yET|yFNF FO} is finite. 


(4.7.3) Other terminology. Some authors call F a fundamental set, 
rather than a coarse fundamental domain. 


The following general principle will be used to show that I is finitely 
generated: 


(4.7.4) Proposition. Suppose a discrete group A acts properly discontinuously 
on a topological space Y. If Y is connected, and A has a coarse fundamental 
domain F that is an open subset of Y, then A is finitely generated. 


Proof. Let S = {se A | sF NF £0}. We know that S is finite (see 
Definition 4.7.2(2)), so it suffices to show that S generates A. Here is the 
idea: think of {AF | A € A} as a tiling of Y. The elements of S can move F 
to any adjacent tile, and Y is connected, so a composition of elements of S 
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can move F to any tile. Therefore () is transitive on the set of tiles. Since 
S also contains the entire stabilizer of the tile #, we conclude that (S) = F. 

Now, here is the formal proof. Consider some A € A, such that AFM 
(S)F # @. This means there exists s € (S), such that AF 1 sF 4 Q, so 
s '\F NF #0. Therefore, by the definition of S, we have s-'X € 9, so 
 € 8S C (8). Thus, we have shown that (A \ (S))F is disjoint from (S)F. 

However, both of these sets are open (since F is open), and their union 
is all of Y (since AF = Y). Therefore, since Y is connected, the two sets 
cannot both be nonempty. Since (S) is obviously nonempty, we conclude that 
A~ (S) =6, so (S) =A. oO 


(4.7.5) Corollary. If [\G is compact, then T is finitely generated. 


Proof. Since ['\G is compact, there is a compact subset C of G, such that 
TC = G (see Exercise 4.1411). Let F be a precompact, open subset of G, 
such that C C F. Because C C F, we have TF = G. Also, because F is 
precompact, and [I acts properly discontinuously on G, we know that Con- 
dition 4.7.2(2) holds. Therefore, F is a coarse fundamental domain for I. 
By passing to a subgroup of finite index, we may assume G is connected 
(see Exercise 1), so Proposition 4.7.4 applies. O 


(4.7.6) Example. Let F be the closed unit square in R?, so F is a coarse 
fundamental domain for the usual action of Z? on R? by translations. Define S 
as in the proof of Proposition 4.7.4, so 


s= { (m,n) € Z* | m,n € {-1,0, 1}} = {0, +a1, +a2, +a3}, 


where a; = (1,0), a2 = (0,1), and a3 = (1,1). Then S' generates Z?; in fact, 
the subset {a1, a2} is already a generating set. 

Proposition 4.7.4 does not apply to this situation, because F is not open. 
We could enlarge ¥ slightly, without changing S$. Alternatively, the proposi- 
tion can be proved under the weaker hypothesis that F is in the interior of 
Uses F (see Exercise 5). 

Note that Z? has the presentation 


DP Say; 03 04 | ity — Tsp Bea — ws): 


(More precisely, if F3 is the free group on 3 generators 171, 2,23, then there 
is a surjective homomorphism ¢: F3 + Z?, defined by 


$(x1) = a1, b(x2) = ae, $(x3) = as, 
and the kernel of ¢ is the smallest normal subgroup of F3 that contains both 
112223 - and ©0123 -.) Each of the relations in this presentation is of a very 
simple form, merely stating that the product of two elements of S is equal 
to another element of S. The proof of the following proposition shows that 
relations of this type suffice to define A in a very general situation. 
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(4.7.7) Proposition. Suppose A acts properly discontinuously on a topological 
space Y. If 


e Y is both connected and simply connected, and 


e there is a coarse fundamental domain F for A that is a connected, open 
subset of Y, 


then A is finitely presented. 


Proof. This is somewhat similar to the proof of Proposition 4.7.4, but is more 

elaborate. As before, let S={XE€A|AF AF AO}. For each s € S, define 

a formal symbol z,, and let F' be the free group on {x,}. Finally, let 
R={2,%:2,,' | s,t, st € S}, 

so R is a finite subset of F’. 

We have a homomorphism ¢: F' > A determined by ¢(#,) = s. From the 
proof of Proposition 4.7.4, we know that ¢ is surjective, and it is clear that 
R Ckerd¢. The main part of the proof is to show that ker ¢ is the smallest 
normal subgroup of F' that contains R. (Since R is finite, and F'/ker¢ & A, 
this implies that A is finitely presented, as desired.) 

Let N be the smallest normal subgroup of F' that contains R. (It is clear 
that N C ker(¢); we wish to show ker(¢) C N.) 

e Define an equivalence relation ~ on (F'/N) x ¥, by stipulating that 

(fN,y) ~ (f'N, y’) if and only if there exists s € S$, such that 7, fN = 
f'N and sy = y’ (see Exercise 6). 


e Let Y be the quotient space ((F/N) x F)/~. 


e Define a map w: (F/N) x F —Y by o(fN,y) = o(f~')y. (Note that, 
because N C ker(@), the map w is well defined.) 


Because 


w(asfN, sy) = (¢(f-*)s*) (sy) = ¥(FN, y), 


we see that ~ factors through to a well-defined map w LY ae 

Let F be the image of (ker(¢)/N) x F in Y. Then it is obvious, from 
the definition of ~, that W(F ) = F. In fact, it is not difficult to see that 
w!(F) =F (see Exercise 7). 

For each f € F’, the image Fy of (fN) x F in Y is open (see Exercise 8), 
and, for fi, fo € ker(@), one can show that Fy, 1 Fy, = if fi # fo (mod N) 
(cf. Exercise 9). Therefore, from the preceding paragraph, we see that w isa 
covering map over F. Since Y is covered by translates of F (and F is open) 
it follows that w is a covering map. 

Because ¥ is connected, it is not difficult to see that Y is connected 
(see Exercise 10). Since Y is simply connected, and w is a covering map, this 
implies that w is a homeomorphism. Hence, w is injective, and it is easy to 
see that this implies ker(¢) = N, as desired. 
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(4.7.8) Remark. The assumption that F is connected can be replaced with 
the assumption that Y is locally connected. However, the proof is somewhat 
more complicated in this setting. 


(4.7.9) Corollary. If ['\G is compact, then T is finitely presented. 


Proof. Let kK be a maximal compact subgroup of G, so I acts properly 
discontinuously on G/K. Arguing as in the proof of Corollary 4.7.5, we see 
that T has a coarse fundamental domain F that is an open subset of G/K. 
From the “Iwasawa decomposition” G = KAN (see Theorem 8.4.9), we see 
that G/K is connected and simply connected (see Exercise 8.4#11(b)). So 
Proposition 4.7.7 implies that T is finitely presented. L 


If [\G is not compact, then it is more difficult to prove that IT is finitely 
presented (or even finitely generated). 


(4.7.10) Theorem. [I is finitely presented. 


Idea of proof. It suffices to find a coarse fundamental domain for [ that is 
a connected, open subset of G/K. Assume, without loss of generality, that [ 
is irreducible. 

In each of the following two cases, a coarse fundamental domain F can 
be constructed as the union of finitely many translates of “Siegel sets’ (This 
will be discussed in Chapters 7 and 19.) 


1) Tis “arithmetic? as defined in (5.1.19), or 


2) G has a simple factor of real rank one, or, more generally, we have 
rankgI < 1 (see Definitions 8.1.6 and 9.1.4). 


The (amazing!) Margulis Arithmeticity Theorem (5.2.1) implies that these 
two cases are exhaustive, which completes the proof. L] 


(4.7.11) Remark. It is not necessary to appeal to the Margulis Arithmeticity 
Theorem in order to prove only that T is finitely generated (and not that it is 
finitely presented). Namely, if (2) does not apply, then the real rank of every 
simple factor of G is at least two, so Kazhdan’s Property (T) implies that [ 
is finitely generated (see Proposition 13.1.7(3)). 


(4.7.12) Remark. For n > 2, T is said to be of type F,, if there is a compact 
CW complex X, such that: 

e the fundamental group 7,(X) is isomorphic to I, and 

e the homotopy group 7,(X) is trivial for 2<k <n. 
Since I is finitely presented (see Theorem 4.7.10), it is easy to see that [ 
is of type Fy. (In fact, a group is of type F> if and only if it is finitely 
presented.) Borel and Serre proved the much stronger result that TI is of 
type F,, for every n. (If G/T is compact, and [I is torsion free, then one 
may let X = [\G/K, where K is a maximal compact subgroup of G. (In 
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other words, X is the locally symmetric space associated to T.) When G/T 
is not compact, but I is torsion free, then X is a certain space called the 
“Borel-Serre compactification” of [\G/K.) 


Exercises for §4.7. 


ra 


#2. 


#3. 


4A, 


#5. 


#6. 


Show that if some finite-index subgroup of A is finitely generated, then 
A is finitely generated. 


[Hint: If F is a finite set of coset representatives for (S) in A, then SUF 
generates A.] 


Assume A is abstractly commensurable to A’. Show A is finitely gener- 
ated if and only if A’ is finitely generated. 


[Hint: Exercise 1 is half of the proof. For the other half, suppose F' is a 
finite set of coset representatives for the subgroup A’ of A, and S is a finite 
generating set for A. For each f € F and s € S, we have fs € A’ f’, for some 
fy, € F. Then { fs(f;.)' | f € Fs © S} generates A’. Alternatively, it 
is easy to prove this topologically: If A is the fundamental group of a CW- 
complex » with only finitely many 1-cells, then A’ is the fundamental group 
of a finite cover of ©, which must also have only finitely many 1-cells.| 


Assume A is abstractly commensurable to A’. Show A is finitely pre- 
sented if and only if A’ is finitely presented. 


[Hint: Suppose A’ has finite index in A, and F is a set of coset represen- 
tatives. Let S and S” be finite generating sets of A and A’, respectively 
(see Exercise 2). 

(<) For each s € SUF and f € F, there exist g € A’ and f’ € F, 
such that fs = gf’. Adding these relations to a presentation of A’ yields a 
presentation of A. 

(=) Proving this direction algebraically is somewhat more complicated, 
but there is an easy topological proof: If A is the fundamental group of a 
CW-complex © whose 2-skeleton is finite, then A’ is the fundamental group 
of a finite cover of &, which must also have only finitely many 1-cells and 
2-cells. | 


Suppose A is a discrete subgroup of a locally compact group H. Show 
that if H/A is compact, and H is compactly generated (that is, there 
is a compact subset C of H, such that (C’) = H), then A is finitely 
generated. (This provides an alternate proof of Proposition 4.7.4 that 
does not require G to be connected.) 

[Hint: Assume e € C. Choose a compact subset F of H, such that FA = H 
(and e € F), and let S=AN(F-'C*'F). If \ = c1ca-+-cn with c € C*’, 
then \= 1°: An, with A; € S)] 


Prove Proposition 4.7.4, replacing the assumption that F is open with 
the weaker assumption that F is in the interior of U,egsF (where S 
is as defined in the proof of Proposition 4.7.4). 


Show that the relation ~ defined in the proof of Proposition 4.7.7 is an 
equivalence relation. 
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#7. In the notation of the proof of Proposition 4.7.7, show that if Y(fN, y) € 
F, then (fN,y) ~ (f’N,y’), for some f’ € ker(¢) and some y’ € F. 
[Hint: We have ¢(f) € S, because ¢(f~')y € F]] 


#8. In the notation of the proof of Proposition 4.7.7, show that the inverse 
image of Fy in (F/N) x F is 


UJ ((csfN/N) x (F 1 8F)), 
ses 
which is open. 


#9. In the notation of the proof of Proposition 4.7.7, show that if we have 
FN Fe #0 and f € ker(¢), then f € N. 
[Hint: If (fN,yi) ~ (N, yz), then there is some s € S with z,N = fN. Since 
f € ker(¢), we have s = ¢(x;) = o(f) =e. 


#10. Show that the set Y defined in the proof of Proposition 4.7.7 is con- 
nected. 
[Hint: For s1,...,8r € S, define F; = {rs,---%s,N} x F. Show there exist 
a € F; and b € Fj+1, such that a ~ 6.] 


#11. Assume A acts properly discontinuously on a topological space Y. Show 
that a Borel subset F of Y is a coarse fundamental domain for A if and 
only if 

a) F contains a strict fundamental domain Fo for A, and 
b) there is a finite subset F’ of A, such that F C FFo. 


§4.8. [T has a torsion-free subgroup of finite index 


(4.8.1) Definition. A group is torsion free if it has no nontrivial finite 
subgroups. Equivalently, the identity element e is the only element of finite 
order. 


(4.8.2) Theorem (Selberg’s Lemma). IT has a torsion-free subgroup of finite 
index. 


Proof. From the Standing Assumptions (4.0.0), we know I C SL(Z,R), for 
some ¢. Let us start with an illustrative special case. 


Case 1. Assume T = SL(@,Z). For any positive integer n, the natural ring 
homomorphism Z —+ Z/nZ induces a group homomorphism T - SL(¢, Z/nZ) 
(see Exercise 2); let [',, be the kernel of this homomorphism. (This is called 
the principal congruence subgroup of SL(¢,Z) of level n.) Since it is the 
kernel of a group homomorphism, we know that I, is a normal subgroup of I. 
It is also not difficult to see that [,, has finite index in I (see Exercise 3). 
It therefore suffices to show that [, is torsion free, for some n. In fact, TP, 
is torsion free whenever n > 3 (see Exercise 5), but, for simplicity, we will 
assume n = p is an odd prime. 
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Given y €T, \ {Id} and k € NX {0}, we wish to show that 7* 4 Id. We 
may write 
y =Id+p"T, 
where 
ed>l, 
e T € Maty,¢(Z), and 
e p{T (that is, not every matrix entry of T is divisible by p). 


Also, we may assume k is prime (see Exercise 4). Therefore, either p { k or 
p= hk: 
Subcase 1.1. Assume p{k. Noting that 
(p°T)? = pT? = 0 (mod p***), 
and using the Binomial Theorem, we see that 
yk = (Id+p*T)* = Id+k(p?T) # Id (mod p***), 
as desired. 


Subcase 1.2. Assume p = k. Using the Binomial Theorem (and noting that 
(”) p™ is divisible by p?*? for i > 1 (see Exercise 1)), we have 


yk = 4? = (Id+p?T)? = Id+p(p"T) = Id+p?*1T F Id (mod p“*?). 


Case 2. Assume T C SL(@,Z). From Case 1, we know there is a torsion-free, 
finite-index subgroup [,, of SL(¢,Z). Then TNT, is a torsion-free subgroup 
of finite index in I. 


Case 3. The general case. The proof is very similar to Case 1, with the ad- 
dition of some commutative algebra (or algebraic number theory) to account 
for the more general setting. 

We know that [ is finitely generated (see Theorem 4.7.10), so there ex- 
ist a1,...,a, € C, such that every matrix entry of every element of I is 
contained in the ring Z = Zla;,...,a,] generated by {a1,...,a,} (see Exer- 
cise 7). Therefore, letting A = SL(¢, Z), we have IC A. 

Now let p be a maximal ideal in Z. Then Z/p is a field, so, because 
Z/p is also known to be a finitely generated ring, it must be a finite field. 
Therefore, the kernel of the natural homomorphism A —> SL(é, Z/p) has finite 
index in A. Basic facts of Algebraic Number Theory allow us to work with 
the prime ideal p in very much the same way as we used the prime number p 
in Case 1. 


(4.8.3) Warning. Our standing assumption that G C SL(¢,R) is needed 
for Theorem 4.8.2. For example, the group Sp(4,R) has an 8-fold cover, 
which we call H. The inverse image of Sp(4, Z) in H is a lattice A in H. It 
can be shown that every finite-index subgroup of A contains an element of 
order 2, so no subgroup of finite index is torsion free. This does not contradict 
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Theorem 4.8.2, because H is not linear: it has no faithful embedding in any 
SL(é,R). 


If y* = Id, then every eigenvalue of y must be a k'® root of unity. If, in 
addition, y # Id, then at least one of these roots of unity must be nontrivial. 
Therefore, the following is a strengthening of Theorem 4.8.2. 


(4.8.4) Theorem. There is a finite-index subgroup I’ of T, such that no 
eigenvalue of any element of I’ is a nontrivial root of unity. 


Proof. Assume [ = SL(@, Z). Let 


e n be some (large) natural number, 


I’, be the principal congruence subgroup of T° of level n, 
e w be a nontrivial k® root of unity, for some k, 

e 7 be an element of [’,, such that w is an eigenvalue of 7, 
T =y- Id, 

Q(x) be the characteristic polynomial of T, and 


e \=w—1, so 4 is a nonzero eigenvalue of T’. 


Since y €T,, we know that n|T, so Q(x) = x +nR(z), for some integral 
polynomial R(x). Since Q(A) = 0, we conclude that A° = n¢, for some 
¢ € ZA]. Therefore, \’ is divisible by n, in the ring of algebraic integers. 

The proof can be completed by noting that any particular nonzero alge- 
braic integer is divisible by only finitely many natural numbers, and there are 
only finitely many roots of unity that satisfy a monic integral polynomial of 
degree ¢. See Exercise 8 for a slightly different argument. O 


(4.8.5) Remarks. 


1) The proof of Theorem 4.8.2 shows that [I has nontrivial, proper, nor- 
mal subgroups, so [ is not simple. However, the normal subgroups 
constructed there all have finite index. In fact, it is often the case 
that every nontrivial, normal subgroup of T has finite index (see The- 
orem 17.1.1). 

Moreover, although it will not be proved in this book, it is often the 
case that all of the normal subgroups of finite index are close to being 
of the type constructed in the course of the proof. More precisely, the 
“Congruence Subgroup Property” asserts there is a constant C’, such 
that if N is any finite-index, normal subgroup of I, then there is a 
principal congruence subgroup I” of I, such that |[’ : N’| < C. This is 
not always true, but it has been proved for the lattices in many groups. 


2) Arguing more carefully, one can obtain a finite-index subgroup I” with 
the stronger property that, for every 7 € I’, the multiplicative group 
generated by the (complex) eigenvalues of y does not contain any non- 
trivial roots of unity. Such a subgroup is sometimes called net. 
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3) If F is any field of characteristic zero, then Theorem 4.8.2 remains valid 
(with the same proof) when I is replaced with any finitely generated 
subgroup A of SL(¢, F’). 


Let us now present an alternate approach to the general case of Theo- 
rem 4.8.2. It requires only the Nullstellensatz, not Algebraic Number Theory. 


Another proof of Theorem 4.8.2 (optional). Let 


e Z be the subring of C generated by the matrix entries of the elements 
of I’, and 


e F' be the quotient field of Z. 


Because I is a finitely generated group (see Theorem 4.7.10), we know that 
Z is a finitely generated ring (see Exercise 7), so F' is a finitely generated 
extension of Q. 


Step 1. We may assume that F = Q(xj,...,2,) is a purely transcendental 
extension of Q. Choose a subfield L = Q(a,,...,2,) of F', such that 


e Lisa purely transcendental extension of Q, and 
e F is an algebraic extension of L. 


Let d be the degree of F' over L. Because F is finitely generated (and algebraic 
over L), we know that d < oo. Therefore, we may identify F’ with L“, so 
there is an embedding 
T CSL(¢, F) — SL(dé, L). 

Hence, by replacing F’' with L (and replacing @ with dé), we may assume that 
F is purely transcendental. (Identifying F’ with L@ is the foundation of an 
important technique called “Restriction of Scalars” that will be introduced in 
Section 5.5.) 


Step 2. If y is any element of finite order in SL(Z, F’), then trace(y) € Z, and 
| trace(y)| < ¢. There is a positive integer k with y* = Id, so every eigenvalue 
of y is a k*® root of unity. The trace of 7 is the sum of these eigenvalues, and 
any root of unity is an algebraic integer, so we conclude that the trace of ¥ is 
an algebraic integer. 

Since trace(y) is the sum of the diagonal entries of 7, we know trace(y) € 
F. Since trace(y) is algebraic, but F' is a purely transcendental extension 
of Q, this implies trace(y) € Q. Since trace(y) is an algebraic integer, this 
implies trace(y) € Z. 

Since trace(y) is the sum of @ roots of unity, and every root of unity is on 
the unit circle, we see, from the triangle inequality, that | trace(y)| < @. 


Step 3. There is a prime number p > 20, such that 1/p ¢ Z. From the 
Nullstellensatz (B4.5), we know that there is a nontrivial homomorphism 
o: Z > Q, where Q is the algebraic closure of Q in C. Replacing Z with 
o(Z), let us assume that Z C Q. Thus, for each z € Z, there is some nonzero 
integer n, such that nz is an algebraic integer. More precisely, because Z is 
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finitely generated, there is an integer n, such that, for each z € Z, there is 
some positive integer k, such that n*z is an algebraic integer. It suffices to 
choose p so that it is not a divisor of n. 


Step 4. There is a finite field E of characteristic p, and a nontrivial homo- 
morphism dp: Z —+ E. Because 1/p ¢ Z, there is a maximal ideal p of Z, 
such that p € p. Then EF = Z/p is a field of characteristic p. Because it is a 
finitely generated ring, E must be a finite extension of the prime field Z/pZ 
(see Theorem B4.3), so F is finite. 


Step 5. Let A be the kernel of the homomorphism bp: SL(é,Z) > SL(Z, EF) 
that 1s induced by @p. Then A is torsion free. Let y be an element of finite 
order in A. Then 


trace(}p(7)) = trace(Id) = @ (mod p), 


so p | (é— trace(y)) (since Step 2 tells us that trace(y) € Z). Since we also 
know that |trace(y)| < @ and p > 2é, we conclude that trace(y) = @. Since 
the ¢ eigenvalues of y are roots of unity, and trace(y) is the sum of these 
eigenvalues, we conclude that 1 is the only eigenvalue of 7. Since y* = Id, we 
know that + is elliptic (hence, diagonalizable over C), so this implies y = Id, 
as desired. O 


Exercises for §4.8. 


##1. Show that if p is an odd prime, d > 1, and 2 <i < p, then (®)p® is 
divisible by p?*?. 


(Hint: If either d > 1 or i > 2, then di > d42.] 


#2. Show that SL(Z,-) is a (covariant) functor from the category of rings 
with identity to the category of groups. That is, show: 
a) if A is any ring with identity, then SL(¢, A) is a group, 
b) for every ring homomorphism ¢: A > B (with ¢(1) = 1), there is 
a group homomorphism ¢,: SL(¢,A) > SL(é, B), and 
c) if¢@: A> Bandy: B —- Care ring homomorphisms (with ¢(1) = 
1 and y(1) = 1), then (wo d), = Wx 0 dy. 


#3. Show that if B is a finite ring with identity, then SL(Z, B) is finite. Use 
this fact to show, for every positive integer n, that if I’, denotes the 
principal congruence subgroup of SL(¢,Z) of level n (cf. Case 1 of the 
proof of Theorem 4.8.2), then T,, has finite index in SL(@, Z). 


#4. Show that if [ has a nontrivial element of finite order, then I has an 
element of prime order. 


#5. Show the principal congruence subgroup I’, is torsion free if n > 3. 


[Hint: Since Ty, C Tn whenever n | m, you may assume, without loss of 
generality, that n is either 4 or an odd prime.| 
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#6. In the notation of Case 1 of the proof of Theorem 4.8.2, show that Iz 
is not torsion free. Where does your solution of Exercise 5 fail? 


#7. Show that if A is a finitely generated subgroup of SL(¢,C), then there 
is a finitely generated subring B of C, such that A Cc SL(¢, B). 


[Hint: Let B be the subring of C generated by the matrix entries of the 
generators of A.| 


#8. Suppose w is a nontrivial root of unity, and (w — 1) = n¢, for some 
mleZ and CeEZ+ Za +-:°+ Ze, Shown < 26)", 
[Hint: Let F' be the Galois closure of the field extension Q(w) of Q generated 
by w, and define N: F + Q by N(2) = [Lo ccarjo) 7(2)- Then N(w — ies 
n“N(C), and |N(w — 1)| < 2¢ < 2", where d is the degree of F over Q.] 


#9. Show that I is residually finite. That is, for every y € [\ {e}, show 
that there is a finite-index, normal subgroup I” of TP, such that y ¢ I’. 
(In particular, if I’ is infinite, then [ is not a simple group.) 


#10. Show there is a sequence N,, No,... of subgroups of I, such that 


a) Ni, D> NoD-:: 4 
b) each N; is a finite-index, normal subgroup of I, and 
c) Ni N Non --: = fe}. 


[Hint: Use Exercise 9.] 


#11. Show that SL(n, Q) commensurates SL(n, Z). 


[Hint: For each g € SL(n, Q), there is a principal congruence subgroup Im 
of SL(n, Z), such that g~'T'mg C SL(n, Z).] 


#12. Show that if y: Pf + GL(n,R) is a homomorphism, such that every 
element of y(T) has finite order, then y(L) is finite. 
[Hint: Remark 4.8.5(3).] 


§4.9. [ has a nonabelian free subgroup 


In this section, we describe the main ideas in the proof of the following im- 
portant result. 


(4.9.1) Theorem (Tits Alternative). Jf A is a subgroup of SL(é,R), then 
either 


1) A contains a nonabelian free group, or 
2) A has a solvable subgroup of finite index. 


Since [is not solvable when G is not compact (see Exercise 4.5746), the 
following is an immediate corollary. 


(4.9.2) Corollary. If G is not compact, then TI contains a nonabelian free 
group. 
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(4.9.3) Definition. Let us say that a homeomorphism ¢ of a topological space 
M is (A-,B, At)-contracting if A~, B and At are nonempty, disjoint, 
open subsets of M, such that ¢(B U At) C At and @ 1(BUA7~)CA™. 


In a typical example, A~ and A® are small neighborhoods of points p~ 
and p*, such that ¢ collapses a large open subset of M into At, and ¢7! 
collapses a large open subset of M into A~ (see Figure 4.9A). 


B B 


B B 


Figure 4.9A. A typical (A~,B,A*)-contracting homeo- 
morphism of the circle. 


(4.9.4) Example. Let 
e M be the real projective line P(R7), 
y= F nA € SL(2,R), 
e A” be any (small) neighborhood of p~ = (0: 1] in P(R?), 
e A* be any (small) neighborhood of p* = [1 : 0] in P(R?), and 
e B be any precompact, open subset of P(R?) \ {p~,p*}. 
For any (x,y) € R? with x 4 0, we have 
ya: y] = [2"@ : 2°"y] = [122° 7" y/a] > [1:0] = pt as n> co, 
and the convergence is uniform on compact subsets. Similarly, we have y~ "|x : 


y] ~ p- asn-— co. Hence, for sufficiently large n, the homeomorphism ” is 
(A~, B, At)-contracting on P(R?). 


More generally, if y is any nontrivial, hyperbolic element of SL(2, R), then 
y" is (A~, B, A*)-contracting on P(R?), for some appropriate choice of A~, 
B, and A* (see Exercise 1). 

The following is easy to prove by induction on n. 


(4.9.5) Lemma. If ¢ is (A~, B, At)-contracting, then 
1) 6"(B) C At for all n > 0, 
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2) 6"(B) CA™ for alin <0, 
3) ¢"(B) C A~ UAT for alln £0. 


The following lemma is the key to the proof of Theorem 4.9.1. 
(4.9.6) Lemma (Ping-Pong Lemma). Suppose 


e ¢ andw are homeomorphisms of a topological space M, 
e A, At, B-, and B* are nonempty, pairwise-disjoint, open subsets 
of M, 
e ¢ is (A_, B, At)-contracting, where B= B~ U Bt, and 
e w is (B_,A, B*)-contracting, where A= A~ UAT. 
Then @ and w have no nontrivial relations; so (@,w) is free. 
Proof. Consider a word of the form w = yw"... ¢™* wy", with each m; 


and n; nonzero. We wish to show w # e. 
From Lemma 4.9.5(3), we have 


e™i(B) CA and wri(A) CB, 06) ey aa eae 


Therefore 


aN 


(A) 
me ph® (A) 
ee nee (A) 

Qe ho eas (A ) 
and so on: points bounce back and forth between A and B. (Hence, the name 
of the lemma.) In the end, we see that w(A) C A. 

Assume, for definiteness, that m; > 0. Then, by applying 4.9.5(1) in 
the last step, instead of 4.9.5(3), we obtain the more precise conclusion that 
w(A) C At. Since A Z A® (recall that A7 is disjoint from A‘), we conclude 
that w # e, as desired. 


(4.9.7) Corollary. If 7, and y2 are two nontrivial hyperbolic elements of 
SL(2, R) that have no common eigenvector, then, for sufficiently large n € ZT, 


the group ((71)", (y2)") is free. 
Proof. Let 
e vu; and w; be linearly independent eigenvectors of 7;, with eigenvalues 
A; and 1/A,, such that A; > 1, 
e A* and A™~ be small neighborhoods of [v;] and [wi] in P(R?), and 
e B* and B~ be small neighborhoods of [vg] and [we] in P(R?). 


By the same argument as in Example 4.9.4, we see that if n is sufficiently 
large, then 


e (y1)" is (A~, B” UB*, At)-contracting, and 
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e (y2)" is (B-, A~ U At, B*)-contracting 


(see Exercise 1). Therefore, the Ping-Pong Lemma (4.9.6) implies that 
((y1)”, (y2)") is free. ie 


We can now give a direct proof of Corollary 4.9.2, in the special case 
where G = SL(2, R). 


(4.9.8) Corollary. If G = SL(2,R), then T contains a nonabelian, free group. 


Proof. By passing to a subgroup of finite index, we may assume that I is 
torsion free (see Theorem 4.8.2). Hence, I has no elliptic elements. Not every 
element of [ is unipotent (see Exercise 4.5410), so we conclude that some 
nontrivial element 7, of I is hyperbolic. 

Let v and w be linearly independent eigenvectors of 71. The Borel Density 
Theorem (4.5.6) implies that there is some y € I’, such that {yv, yw}N (Rv U 
Rw) = @ (see Exercise 2). Let y2 = yy1y~!, so Y2 is a hyperbolic element 
of I with eigenvectors yu and yw. 

From Corollary 4.9.7, we conclude that ((71)”, (y2)”) is a nonabelian, free 
subgroup of I’, for some n € Z*. O 


The same ideas work in general: 


Idea of direct proof of Corollary 4.9.2. Assume G C SL(é,R). Choose 
some nontrivial, hyperbolic element y; of T, and let Ay > Ag > +--+ > Ag be 
its eigenvalues. We may assume, without loss of generality, that A; > Ae. 
(If the eigenvalue \; has multiplicity d, then we may pass to the d‘® exterior 
power A4(R°), to obtain a representation in which the largest eigenvalue of 7; 
is simple.) 

Let us assume that the smallest eigenvalue Ag is also simple; that is, 
Ae < Ae-1. (One can show that this is a generic condition in G, so it can be 
achieved by replacing 7, with some other element of I.) 

Let v be an eigenvector corresponding to the eigenvalue A; of 71, and let 
w be an eigenvector for the eigenvalue Ay. Assume, to simplify the notation, 
that all of the eigenspaces of 7; are orthogonal to each other. Then, for any 
x € R’\ vt, we have (y1)"[z] > [v] in P(R®), as n — oo (see Exercise 3). 
Similarly, if x ¢ wt, then (y1)~"[z] > [w]. 

We may assume, by replacing R‘ with a minimal G-invariant subspace, 
that R* has no nontrivial, proper, G-invariant subspaces. Then the Borel 
Density Theorem implies that there exists 7 € I, such that we have {yv, yw}N 
(Rv U Rw) = 0. 

Then, for any small neighborhoods A~, At, B~, and B* of [v}, [w], [ye], 
and [yw], and any sufficiently large n, the Ping-Pong Lemma implies that the 
subgroup ((71)”, (yy17~+)”) is free. O 
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(4.9.9) Remark. The proof of Theorem 4.9.1 is similar, but involves additional 
complications. 


1) 


In order to replace R‘ with an irreducible subspace W, it is necessary 
to have dimW > 1 (otherwise, there do not exist two linearly indepen- 
dent eigenvectors v and w). Unfortunately, the minimal A-invariant 
subspaces may be 1-dimensional. After modding these out, the mini- 
mal subspaces in the quotient may also be 1-dimensional, and so on. 
In this case, the group A consists entirely of upper-triangular matrices 
(after a change of basis), so A is solvable. 


The subgroup A may not have any hyperbolic elements. Even worse, 
it may be the case that 1 is the absolute value of every eigenvalue 
of every element of A. (For example, A may be a subgroup of the 
compact group SO(n), so that every element of A is elliptic.) In this 
case, the proof replaces the usual absolute value with an appropriate 
p-adic norm. Not all eigenvalues are roots of unity (cf. Theorem 4.8.4), 
so Algebraic Number Theory tells us that some element of A has an 
eigenvalue whose p-adic norm is greater than 1. The proof is completed 
by using this eigenvalue, and the corresponding eigenvector, just as we 
used A; and the corresponding eigenvector v. 


Exercises for §4.9. 


#1. 


HD 


#3. 


In the notation of the proof of Corollary 4.9.7, show that if A~, AT, 
B~-, and B™ are disjoint, then, for all large n, the homeomorphism 
(y1)” is (A~, B~ U Bt, At)-contracting on P(R?). 


Assume that G is irreducible in SL(@, R) (see Definition A7.3), and that 
T projects densely into the maximal compact factor of G. If F is a finite 
subset of R’\ {0}, and W is a finite set of proper subspaces of R’, show 
that there exists y € I, such that 


VFO tg W=9. 
wew 


[Hint: For v € F and W € W, the set Au.w ={g €G| gv € W} is Zariski 
closed, so U..w Av,w is Zariski closed. Apply the Borel Density Theorem 
and Exercise A4#7.| 


Let 
e » bea hyperbolic element of SL(¢, R), 
© \y >A. >-::: > re be the eigenvalues of ¥, 
e v be an eigenvector of y corresponding to the eigenvalue A,, and 
e W be the sum of the other eigenspaces. 
Show that if x € P(R*) \ [W], then y"x — [v] as n — oo. Furthermore, 
the convergence is uniform on compact subsets. 
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#4. (another version of the Ping-Pong Lemma) Suppose A and B are dis- 
joint, nonempty subsets of M, such that ¢"(B) C A and w"(A) C B, 
for every nonzero integer n. Show (¢, w) is free. 

[Hint: If every m; and n,; is nonzero, then d™!w"!---P™* wrk h™*+1(B) C 
Al 

#5. Let I’ = <r ‘| ; E a) be the Sanov subgroup of SL(2,Z). Show 

that I’ is a free subgroup of finite index in SL(2, Z). 


4k+1 2¢ 
2m 


diy 1| are 


[Hint: Exercise 4 implies I’ is free. Matrices of the form 
in T’] 

#6. Generalizing Exercise 5, show that every torsion-free subgroup of SL(2, Z) 
is a free group. 
[Hint: Let OF be the boundary of the usual fundamental domain for the 
action of SL(2, Z) on the upper half plane $9)” (see Figure 1.3A). Then ven’ 
OF is a contractible 1-dimensional simplicial complex; in other words, it is a 
tree. I acts properly on this tree, so any subgroup of [' that acts freely must 
be a free group.| 


#7. Show there is an irreducible lattice [ in SL(2,R) x SO(3), such that 
[TN SL(2, R) is infinite. 


[Hint: There is a free group F' and a homomorphism ¢: F' — SO(3), such 
that ¢(F') is dense in SO(3).] 


§4.10. Moore Ergodicity Theorem 


All mathematicians encounter situations in which they would like to prove 
that some function y on some space X is constant. If X = G/T, this means 
that they would like to prove y is G-invariant. 


(4.10.1) Definition. Suppose f is a function on G/T, and H is a subgroup 
of G. We say that f is H-invariant if f(hx) = f(x) for all h © H and 
xeéeG/T. 


The following fundamental result shows that it suffices to prove ¢ is invari- 
ant under a much smaller subgroup of G (if we make the very weak assumption 
that y is measurable). It does not suffice to prove that vy is invariant under a 
compact subgroup, because it is easy to find a non-constant, continuous func- 
tion on G/T that is invariant under any given compact subgroup of G (unless 
G itself is compact) (see Exercise 1), so the following result is optimal. 
(4.10.2) Theorem (Moore Ergodicity Theorem). Suppose 

e G is connected and simple, 

e H is a closed, noncompact subgroup of G, and 

e vo: G/T > C is H-invariant and measurable. 


Then vy is constant (a.e.). 
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Exercise 3 shows how to derive this theorem from the following more 
general result that replaces [ with a discrete subgroup that need not be a 
lattice. (This generalization will be a crucial ingredient in Section 7.4’s proof 
of the important fact that SL(n,Z) is a lattice in SL(n,R).) In this more 
general situation, we impose an £L” integrability hypothesis on y, in order to 
compensate for the fact that G/A is not assumed to have finite measure (cf. 
Exercise 2). 


(4.10.3) Theorem. Suppose 

e G is connected and simple, 

e H is a closed, noncompact subgroup of G, 

e A is a discrete subgroup of G, and 

e ¢ is an H-invariant £?-function on G/A (with 1 <p < ov). 
Then ~ is constant (a.e.). 


Idea of proof. To illustrate the key ingredient in the proof, let us consider 
only the special case where G = SL(2,R) and A is the group of diagonal 
matrices. (A proof of the general case will be given in Section 11.2.) Let 


b= le* 0 10 
at = | olen and we [19]. 


0 
Note that straightforward matrix multiplication (see Exercise 4) shows 
lim atua~* =e. (4.10.4) 
t+oo 


For g € G, define gp: G/A > C by gy(x) = v(g7 tx). We are assuming 
that vy is a‘-invariant (which means a‘y = ), and the crux of the proof is the 
observation that we can use (4.10.4) to show that y» must also be u-invariant: 
we have 

lupe — Yllp = llarup — a*yllp (Exercise 10) 
= ||(a’ua~*)aty — a’ yllp (inserting a~‘a‘) 


i =i raed 


— t,,,—t = 
= |[(a'ua™) 9 — ¥llp y is H-invariant 


(4.10.4) a) 


> |lep—-gllp ast—oo Gee 11 
= 0, 


so up =  (a.e.). 
Thus, from the fact that y is H-invariant, we have shown that 
y must also be [} 9]-invariant (a.e.). 
The same calculation, but with t — —oo, shows that 


y must also be [4 7]-invariant (a.e.). 
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Since E ‘| and F i generate SL(2,R) = G (see Exercise 6), we conclude 


0 


that y is G-invariant (a.e.). Since G is transitive on G/A, this implies that » 
is constant (a.e.) (see Exercise 7). O 


Exercises for §4.10. 


#1. 


4.2. 


#3. 


4A, 
45. 


#16. 


#7. 


#8. 


#9. 


410. 


Show that if K is any compact subgroup of G, and G is not compact, 
then there is a continuous, K-invariant function on G/T that is not 
constant. 


Show there is a counterexample to Theorem 4.10.3 if we remove the 
assumption that the measurable function ¢ is L£?. 

[Hint: It is easy to construct an counterexample by taking A to be trivial (or 
finite).] 

Derive Theorem 4.10.2 from Theorem 4.10.3. 

[Hint: If there is a nonconstant H-invariant function on G/T, then there is 
one that is bounded.| 


Verify Equation (4.10.4). 


Suppose 

e G and H are as in the Moore Ergodicity Theorem (4.10.2), and 

e X is an H-invariant, measurable subset of G/T. 
Show that either X has measure 0, or the complement of X has mea- 
sure 0). 


Show SL(2,R) is generated by the subgroups { E 9] \ and { F | \. 


(Hint: If a matrix can be reduced to the identity matrix by a sequence of 
elementary row operations, then it is a product of elementary matrices. 


Suppose y is a measurable function on G/A, and for each g € G, we 
have y(gx) = v(x) for a.e. x € G/A. Show y is constant (a.e.). 
[Hint: Use Fubini’s Theorem to reverse the quantifiers. ] 


Suppose G and H are as in the Moore Ergodicity Theorem (4.10.2). 
Show that Hz is dense in G/T, for ae. x € G/T. 


[Hint: Use Exercise 5. For any open subset O of G/T, the set HO is mea- 
surable (why?) and H-invariant.| 


Assume G is simple, and let H be a subgroup of G (not necessarily 
closed). Show that every (real-valued) H-invariant measurable function 
on G/T is constant (a.e.) if and only if the closure of H is not compact. 


Show that if A is a discrete subgroup of G, and py is a G-invariant mea- 
sure on G/A, then ie gy dp = ee y du, for every g € G and measurable 
yp: G/A>C. 

[Hint: Apply a change of variables, and use the fact that gj = pw (since p is 
G-invariant).| 
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#11. Show that G acts continuously on £°G/A) if 1 < p < oo. More 
precisely, show that if A is a discrete subgroup of G, and we define 
a: Gx L°G/A) > L°G/A) by a(g, vy) = gy, then a is continuous. 
[Hint: To show a is continuous in g, use Lusin’s Theorem (B6.6) to approx- 


imate y by a uniformly continuous function. Then use Exercise 10 (and the 
Triangle Inequality) to complete the proof.] 


Notes 


Raghunathan’s book [16] is the standard reference for the basic proper- 
ties of lattices. It contains almost all of the material in this chapter, except 
the Tits Alternative (Section 4.9) and the Moore Ergodicity Theorem (Sec- 
tion 4.10). 

Remark 4.4.5 (the existence of unipotent elements in noncompact lattices) 
was proved by Kazhdan and Margulis [13]. Expositions can be found in [3] 
and [16, Cor. 11.13, p. 180]. 

The Borel Density Theorem (4.5.1) was proved by Borel [2]. It appears 
in (14, Thm. 2.4.4, p. 93], [16, Thm. 5.5, p. 79], and (21, Thm. 3.2.5, pp. 41- 
42]. Several authors have published generalizations or alternative proofs (for 
example, [6, 9, 20]). 

Our presentation of Propositions 4.7.4 and 4.7.7 is based on [15, pp. 195— 
199]. A proof of Remark 4.7.8 can also be found there. A proof of The- 
orem 4.7.10 for the case where [I is arithmetic can be found in [4] or [15, 
Thm. 4.2, p. 195]. For the case where rankg I = 1, see [10] or [16, Cor. 13.20, 
p. 210). 

Borel and Serre [5, §11.1] proved [ is of type F;, (see Remark 4.7.12). 
(We remark that there is no harm in assuming [ is torsion free, since being of 
type F,, is invariant under passage to finite-index subgroups [11, Cor. 7.2.4, 
p. 170].) 

Theorem 4.8.2 is proved in [16, Thm. 6.11, p. 93] and [4, Cor. 17.7, p. 119], 
in stronger forms that establish Remark 4.8.5(2,3). Our alternate proof of 
Theorem 4.8.2 is excerpted from the elementary proof in [1]. 

Warning 4.8.3 is due to P. Deligne [8]. See also [17]. 

For an introduction to the Congruence Subgroup Property, see [12, 
Chap. 6] or [18]. 

The Tits Alternative (4.9.1) was proved by Tits [19]. A nice introduction 
(and a proof of some special cases) can be found in [7]. 

See Section 14.2 for more on the Moore Ergodicity Theorem (4.10.2) and 
related results. 
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Chapter 5 


What ts an 
Arithmetic Group? 


SL(n, Z) is the most basic example of an “arithmetic group’ We will see 
that, by definition, the other arithmetic groups are obtained by intersecting 
SL(n, Z) with some semisimple subgroup G of SL(n,R). More precisely, if 
G is a subgroup of SL(n,R) that satisfies certain technical conditions (to be 
explained in Section 5.1), then GN SL(n,Z) (the group of “integer points” 
of G) is said to be an arithmetic subgroup of G. However, the official 
definition (5.1.19) also allows certain modifications of this subgroup to be 
called arithmetic. 

Different embeddings of G into SL(n,R) can yield different intersections 
with SL(n, Z), so G has many different arithmetic subgroups. (Examples can 
be found in Chapter 6.) Theorem 5.1.11 tells us that all of them are lattices 
in G. In particular, SL(n, Z) is a lattice in SL(n, R). 


§5.1. Definition of arithmetic subgroups 


We are assuming that G is a subgroup of SL(@, R) (see the Standing Assump- 
tions (4.0.0)), and we are interested in T = GM SL(Z,Z), the set of “integer 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [is a lattice in the semisimple Lie group G C SL(¢, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Ma itions 4.1.9 and 4.2.1 (lattice 


subgroups and commensurability ). 
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points” of G. However, in order for the integer points to form a lattice, 
G needs to be well-placed with respect to SL(¢,Z). (If we replace G by a 
conjugate under some terrible irrational matrix, perhaps GM SL(¢, Z) would 
become trivial (see Exercise 1).) The following proposition is an elementary 
illustration of this idea. 


(5.1.1) Proposition. The following are equivalent, for every subspace W 
of R®: 

1) WAZ! is a cocompact lattice in W. 

2) W is spanned by WO Z*. 

3) WNQ* is dense in W. 

4) W can be defined by a set of linear equations with coefficients in Q. 


Proof. Let k = dimW. 

(1 > 2) Let V be the R-span of WN Z‘. Then W/V, being a vector space 
over R, is homeomorphic to R@, for some d. On the other hand, we know that 
WoZ! CV, and that W/(W 9 Z*) is compact, so W/V is compact. Hence 
d=0,soV =W. 

(2 > 1) Let {e1,...,e,} be the standard basis of R*’. Because Wn Zé 
contains a basis of W, there is a linear isomorphism T': R* + W, such that 
T({e1,...,€n}) CWNZ*. This implies that T(Z*) CWO Z*. Since R*/Z* 
is compact, and T is continuous, we conclude that W/(W M Z*) is compact. 

(2 = 3) As in the proof of (2 > 1), there is a linear isomorphism T: R* > 
W, such that T(Z*) CWOZ*. Then T(Q*) CWNQ*. Since Q* is dense 
in R*, and T is continuous, we conclude that T(Q*) is dense in W. 

(4 = 2) By assumption, W is the solution space of a system of linear 
equations whose coefficients belong to Q. (Since R° is finite dimensional, only 
finitely many of the equations are necessary.) Therefore, by elementary linear 
algebra (row reductions), we may find a basis for W that consists entirely 
of vectors in Q*. Multiplying by a scalar to clear the denominators, we may 
assume that the basis consists entirely of vectors in Z*. 

(3 > 4) Since WN Q° is dense in W, we know that the orthogonal com- 
plement W~+ is defined by a set of linear equations with rational coefficients. 
(For each w € WN Q*‘, we write the equation w-xz = 0.) Thus, from (4 => 2), 


we conclude that there is a basis v1,...,Um of We; such that each vj € Q’. 
Then W = (W+)+ is defined by the system of equations vj - 2 = 0, ..., 
Um: x=O0. im 


With the above proposition in mind, we make the following definition. 


(5.1.2) Definition (cf. Definitions A4.1 and A4.7). Let H be a closed sub- 
group of SL(é,R). We say H is defined over Q (or that H is a Q-subgroup) 
if there is a subset Q of Q/x11,...,2¢¢], such that 


e Var(Q) = {g € SL(Z,R) | Q(g) = 0, VQ € QO} isasubgroup of SL(¢, R), 
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e H° = Var(Q)°, and 
e H has only finitely many components. 
In other words, H is commensurable to the variety Var(Q), for some set Q of 
Q-polynomials. 
(5.1.3) Examples. 
1) SL(£,R) is defined over Q: let O = 0. 
2) If mn < €, we may embed SL(n,R) in the top left corner of SL(@,R). 
This copy of SL(n,R) is defined over Q: let 
Q={2,; — 6) | max{i,j} > n}. 
3) For A € SL(¢, Q), the group SO;(A;R) = {g € SL(¢,R) | gAg? = A} is 
defined over Q: let 


oe y ipApg®jq— Aig | 1Sij Gmtn 
1l<p,qgm+n 
In particular, SO(m,n), under its usual embedding in SL(m +n, R), is 
defined over Q. 
4) SL(n, C), under its usual embedding in SL(2n, R), is defined over Q (cf. 
Example A4.2(4)). 


(5.1.4) Remarks. 

1) There is always a subset Q of R[x11,...,2%e,], such that G is commen- 
surable to Var(Q) (see Theorem A4.9); that is, G is defined over R. 
However, it may not be possible to find a set Q that consists entirely 
of polynomials whose coefficients are rational, so G may not be defined 


over Q. 
2) If G is defined over Q, then the set Q of Definition 5.1.2 can be chosen 
to be finite (because the ring Q[r1,1,...,2¢,] is Noetherian). 


(5.1.5) Proposition. G is isogenous to a group that is defined over Q. 


Proof. It is easy to handle direct products, so the crucial case is when G' is 
simple. This is easy if G is classical. Indeed, the groups in Examples A2.3 
and A2.4 are defined over Q (after identifying SL(@,C) and SL(¢,H) with 
appropriate subgroups of SL(2é, IR) and SL(42@, R), in a natural way). 

The general case is not difficult for someone familiar with exceptional 
groups. Namely, since AdG is a finite-index subgroup of Aut(g), it suffices 
to find a basis of g, for which the structure constants of the Lie algebra are 
rational. We omit the details. 


(5.1.6) Notation. For each subring O of R (containing 1), we construct 
Go = GNSL(n,O). That is, Go is the subgroup consisting of the elements 
of G whose matrix entries all belong to O. 
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(5.1.7) Example. Let ¢: SL(n,C) > SL(2n,R) be the natural embedding. 
Then 


(SL(n,C)) 9 = o(SL(n, Qii))). 


Therefore, if we think of SL(n,C) as a Lie group over R, then SL(n, Qji]) 
represents the “Q-points” of SL(n, C). 


The following result provides an alternate point of view on being defined 
over Q. It is the nonabelian version of (3 = 4) of Proposition 5.1.1. 


(5.1.8) Proposition. Let H be a connected subgroup of SL(é,R) that is almost 
Zariski closed. The group H is defined over Q if and only if Hg ts dense in H. 


Proof. (<) Let Qc = {Q € C[aii,..., ree] | Q(h) = 0, Vh € H}. Also, 
for d € N, let O4 = {Q € Qc | degQ < d}. Since Hg is dense in H (and 
polynomials are continuous), it is clear that OF is invariant under the Galois 
group Gal(C/Q), so it is not difficult to see that Q¢ is spanned (as a vector 
space over C) by a collection Q% of polynomials with rational coefficients 
(see Exercise 2). Since H is almost Zariski closed, and polynomial rings are 
Noetherian, we have H° = Var(Q¢/ for d sufficiently large. The polynomials 
in Q? all have rational coefficients, so this implies that H is defined over Q. 

(=) See Exercise 5.3#8 for a proof when G is simple and G/Gz is not 
compact. The general case utilizes a fact from the theory of algebraic groups 
that will not be proved in this book (see Exercise 3). O 


(5.1.9) Warning. Proposition 5.1.8 requires the assumption that H is con- 
nected; there are subgroups H of SL(¢,R), such that H is defined over Q, but 
Hg is not dense in H. For example, let 


H = {h € SO(2) | h® =Id}. 


(5.1.10) Remark. The Jacobson-Morosov Lemma (A5.8) has a relative ver- 
sion: if G is defined over Q, and uw is a nontrivial, unipotent element of Go, 
then there is a (polynomial) homomorphism ¢: SL(2,R) — G, such that 


$([o t]) =u and g(SL(2, Q)) € Ge. 


We now state a theorem of fundamental importance in the theory of 
lattices and arithmetic groups. It is a nonabelian analogue of the obvious fact 
that Z° is a lattice in R*, and of (4 > 1) of Proposition 5.1.1. 


(5.1.11) Major Theorem. /[f G is defined over Q, then Gz is a lattice in G. 


Proof. The statement of this theorem is more important than its proof, so, for 
most purposes, the reader could accept this fact as an axiom, without learning 
the proof. For those who do not want to take this on faith, a discussion of 
two different proofs can be found in Chapter 7 (with some additional details 
in Chapter 19). O 
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(5.1.12) Example. Here are some standard cases of Theorem 5.1.11. 
1) SL(2, Z) is a lattice in SL(2, R). (We proved this in Example 1.3.7.) 
2) SL(n, Z) is a lattice in SL(n,R). (We will prove this in Chapter 7.) 
3) SO(m,n)z is a lattice in SO(m,n). 
4) SL(n, Z[2]) is a lattice in SL(n, C) (cf. Example 5.1.7). 


(5.1.13) Example. As an additional example, let 
G = SO(7a7 — 3 — #3;R) & SO(1, 2). 
Then Theorem 5.1.11 implies that Gz is a lattice in G. This illustrates that 


the theorem is a highly nontrivial result. For example, in this case, it may 
not even be obvious to the reader that Gz is infinite. 


(5.1.14) Warning. Theorem 5.1.11 requires our standing assumption that 
G is semisimple; there are subgroups H of SL(é,R), such that H is defined 
over Q, but Hz is not a lattice in H. For example, if H is the group of 
diagonal matrices in SL(2,R), then Hz is finite, not a lattice in H. 


(5.1.15) Remark. The converse of Theorem 5.1.11 holds when G has no com- 
pact factors (see Exercise 5). 


Combining Proposition 5.1.5 with Theorem 5.1.11 yields the following 
important conclusion: 


(5.1.16) Corollary. G has a lattice. 


In fact, a more careful look at the proof shows that if G is not compact, 
then the lattice we constructed is not cocompact: 


(5.1.17) Corollary. If G is not compact, then G has a noncocompact lattice. 


Proof. Assume that G is classical, which means it is one of the groups listed in 
Examples A2.3 and A2.4. As was mentioned in the proof of Proposition 5.1.5, 
each of these groups has an obvious Q-form Gg, obtained by replacing R 
with Q (or replacing C with QJj?]), in a natural way. Whenever G is non- 
compact, it is not difficult to see that Gg has a nontrivial unipotent element 
(see Exercise 8), so Corollary 4.4.4 tells us that G/G'z is not compact. 


(5.1.18) Remark. We will show in Theorem 18.7.1 that G also has a cocompact 
lattice, and a special case that illustrates the main idea of the proof will be 
seen much earlier, in Example 5.5.4. 


A lattice of the form Gz is said to be arithmetic. However, for the fol- 
lowing reasons, a somewhat more general class of lattices is also said to be 
arithmetic. The idea is that there are some obvious modifications of Gz that 
are also lattices, and any subgroup that is obviously a lattice should be called 
arithmetic. 
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e If 6: G; > Gz is an isomorphism, and T, is an arithmetic subgroup 
of G,, then we wish to be able to say that ¢([1) is an arithmetic 
subgroup in Go. 

e We wish to ignore compact groups; that is, modding out a compact 
subgroup should not affect arithmeticity. So we wish to be able to say 
that if K is a compact normal subgroup of G, and TI is a lattice in G, 
then [ is arithmetic if and only if [K/K is an arithmetic subgroup 
of G/K 

e Arithmeticity should be independent of commensurability. 


The following formal definition implements these considerations. 


(5.1.19) Definition. [ is an arithmetic subgroup of G if and only if there 
exist 


e a closed, connected, semisimple subgroup G’ of some SL(n,R), such 
that G’ is defined over Q, 


e compact normal subgroups K and K’ of G° and G’, respectively, and 
e an isomorphism ¢: G°/K > G'/K’, 


such that ¢([) is commensurable to Gi, where I’ and G, are the images of 
I'M G° and Gy in G°/K and G’/K", respectively. 


(5.1.20) Remarks. 


1) If G has no compact factors, then it is obvious that the subgroup K in 
Definition 5.1.19 must be finite. 


2) Corollary 5.3.2 will show that if G/T is not compact (and [I is irre- 
ducible), then the annoying compact subgroups are not needed in Def- 
inition 5.1.19. 


3) On the other hand, if [ is cocompact, then a nontrivial (connected) 
compact group K’ may be required (even if G has no compact fac- 
tors). We will see many examples of this phenomenon, starting with 
Example 5.5.4. 


4) Up to conjugacy, there are only countably many arithmetic lattices in G, 
because there are only countably many finite subsets of the polynomial 
ring Q[vij,...,2e,0]. 


(5.1.21) Other terminology. Our definition of arithmetic subgroup as- 
sumes the perspective of Lie theory, where I’ is assumed to be embedded in 
some Lie group G. The theory of algebraic groups has a more strict defini- 
tion, which requires [T to be commensurable to Gz: arbitrary isomorphisms 
are not allowed, and compact subgroups cannot be ignored. At the other ex- 
treme, abstract group theory has a much looser definition, which completely 
ignores G: if an abstract group A is abstractly commensurable to a group 
that is arithmetic in our sense, then A is considered to be arithmetic. 
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Exercises for §5.1. 


#1. 


#2. 


#3. 


4A, 


#5. 


#6. 


HT. 


Show that if G is connected, G C SL(¢,R), and —Id ¢ G, then there 
exists h € SL(€,IR), such that (h~+Gh) NSL(@, Z) is trivial. 

[Hint: For each nontrivial y € SL(é,Z), let X 7 = {h € SL(¢,R) | hyh7' € 
G}. Then each Xy is nowhere dense in SL(@,R) (see Exercise A4#4(b)).] 
Let W be a vector subspace of C”, for some n. Show that W is invariant 
under Gal(C/Q) if and only if W is spanned by a set of vectors with 
rational coordinates. 


[Hint: (=) Choose w € W ~ {0} with a minimal number of nonzero coordi- 
nates, and multiply by a scalar to assume at least one coordinate is a nonzero 
rational. Since o(w) — w € W for all o € Gal(C/Q), the minimality implies 
w € Q”. Mod out w and induct on the dimension] 


It can be shown that G° is unirational. This means there exists an 
open subset U of some R”, and a function f: U > G°, such that 
e f(U) contains an open subset of G, and 
e each matrix entry of f(x) is a rational function of x (that is, a 
quotient of two polynomials). 
Furthermore, if G is defined over Q, then f can be chosen to be defined 
over Q (that is, all of the coefficients of f are in Q). 
Assuming the above, show that Gg is dense in G if G is connected 
and G is defined over Q. 


[Hint: Unirationality implies that Gg contains an open subset of G.] 


For H as in Warning 5.1.9, show that Hg is not dense in H. 
[Hint: H is finite, and Ho # H.| 


Show that if G C SL(é@,R), G has no compact factors, and Gz is a 
lattice in G, then G is defined over Q. 
[Hint: See the proof of Proposition 5.1.8(<). Since Gz is a lattice in G, the 
Borel Density Theorem (4.5.6) implies that Qé is invariant under the Galois 
group. | 
Show that if 

e GCSL(¢,R), and 

e Gz is Zariski dense in G, 
then Gz is a lattice in G. 
[Hint: It suffices to show that G is defined over Q.] 


Show that if 

e G has no compact factors, 

e I’, and ly are arithmetic subgroups of G, and 

e I’; NT. is Zariski dense in G, 
then I’; is commensurable to Ts. 
[Hint: Suppose ¢;: G — Hj is an isomorphism, such that ¢,;([;) = (H;)z. 
Define ¢: G + Hi x Hz by (9) = (¢1(9), G2(g)). Then 6(G@)z = 6(T1 NT 2) 
is Zariski dense in #(G), so TP; NT 2 is a lattice in G (see Exercise 6). A similar 
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(but slightly more complicated) argument applies if ¢;: G > H;/K,, where 
K; is compact.| 

##8. For each classical simple group G in Examples A2.3 and A2.4, let Go 
be the subgroup obtained by replacing R with Q, C with Qji], or H 
with Hg = Q+@+Qj + Qk, as appropriate. Show that if G is not 
compact, then Gg contains a nontrivial unipotent element. 


[Hint: Show that Gg contains a copy of either SL(2,Q), SO(1,2)o, or 
SU(1, L)o (cf. Remark A2.6).] 


§5.2. Margulis Arithmeticity Theorem 


The following astonishing theorem shows that taking integer points is usually 
the only way to make a lattice. (See Section 16.3 for a sketch of the proof.) 
(5.2.1) Theorem (Margulis Arithmeticity Theorem). Jf 
e G is not isogenous to SO(1,n) x K or SU(1,n) x K, for any compact 
group K, and 
e I is irreducible, 


then T is arithmetic. 


(5.2.2) Warning. Unfortunately, 

e SL(2,R) is isogenous to SO(1, 2), and 

e SL(2,C) is isogenous to SO(1, 3), 
so the arithmeticity theorem says nothing about the lattices in these two 
important groups. 


(5.2.3) Remark. The conclusion of Theorem 5.2.1 can be strengthened: the 
subgroup K of Definition 5.1.19 can be taken to be finite. More precisely, if 
G and [ are as in Theorem 5.2.1, and G is noncompact and has trivial center, 
then there exist 


e a closed, connected, semisimple subgroup G’ of some SL(@,R), such 
that G’ is defined over Q, and 


e a surjective (continuous) homomorphism ¢: G’ > G, 
such that 


1) (GZ) is commensurable to ['; and 


2) the kernel of ¢ is compact. 
(5.2.4) Remarks. 


1) For any G, it is possible to give a reasonably complete description of 
the arithmetic subgroups of G (up to conjugacy and commensurabil- 
ity). Some examples are worked out in fair detail in Chapter 6. More 
generally, Theorem 18.5.3 (or the table on page 380) essentially pro- 
vides a list of all the irreducible arithmetic subgroups of almost all of 
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the classical groups. Thus, for most groups, the Margulis Arithmeticity 
Theorem provides a list of all the lattices in G. 


Furthermore, knowing that IT is arithmetic provides a foothold to use 
algebraic and number-theoretic techniques to explore the detailed struc- 
ture of I’. For example, we saw that it is easy to show I is torsion free 
if [ is arithmetic (see Theorem 4.8.2). A more important example is 
that (apparently) the only known proof that every lattice is finitely 
presented (see Theorem 4.7.10) relies on the Margulis Arithmeticity 
Theorem. 


It is known that there are nonarithmetic lattices in SO(1,7) for ev- 
ery n (see Corollary 6.5.16), but we do not yet have a theory that 
describes them all when n > 3. Also, nonarithmetic lattices have been 
constructed in SU(1,n) for n € {1,2,3}, but (apparently) it is still not 
known whether they exist when n > 4. 


(5.2.5) Remark. The subgroup 


Comma (I) = {g € G| gg" is commensurable to I} 


is called the commensurator of I in G. It is easy to see that if G is defined 
over Q, then Gg C Comme (Gz) (cf. Exercise 4.8411). 


1) 


This implies that if [ is arithmetic (and G is connected, with no com- 
pact factors), then Commg(Gz) is dense in G (see Proposition 5.1.8). 
Margulis proved a converse. Namely, if G is connected and has no com- 
pact factors, then IT is arithmetic iff Commg(I) is dense in G 
(see Theorem 16.3.3). This is known as the Commensurability Criterion 
for Arithmeticity. 


In some cases, the commensurator of G'z is much larger than G'g (see Ex- 
ercise 1). However, it was observed by Borel that this never happens 
when the “complexification” of G° has trivial center (and other mi- 
nor conditions are satisfied) (see Exercise 4). (See Section 18.1 for an 
explanation of the complexification.) 


Exercises for §5.2. 


#1. 


#2. 
#3. 


Let G = SL(2,R) and Gz = SL(2,Z). Show Commg(Gz) is not com- 
mensurable to Gg. 


[Hint: The diagonal matrix diag (/D, 1 /\/P) commensurates Gz, for all p € 
Z| 


Show Commsr3,r) (SL(3, Z)) is not commensurable to SL(3, Q). 


Show that if G is simple and [I is not arithmetic, then T, NG(T), and 
Commg(I) are commensurable to each other. 
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#4. (requires some knowledge of algebraic groups) Assume G is connected 
and Gz is Zariski dense in G (cf. Corollary 4.5.6). The complexification 
G ® C is defined in Notation 18.1.3. 
Show that if Z(G @ C) = {e}, then Commg(Gz) = Go. 


(Hint: For g € Comme¢(Gz), we know that Adg is an automorphism of 
the Lie algebra g that is defined over Q, so Adg € (AdG)g. However, the 
assumptions imply that the adjoint representation is an isomorphism (and it 
is defined over Q).] 


#5. Show that the assumption Z(G ®C) = {e} cannot be replaced with the 
weaker assumption Z(G) = {e} in Exercise 4. 


[Hint: Any matrix in GL(3, Q) has a scalar multiple that is in SL(3,R), but 
SL(3, Q) has infinite index in GL(3, Q).| 


§5.3. Unipotent elements of noncocompact lattices 


The following result answers one of the most basic topological questions about 
the manifold G/Gz: is it compact? 


(5.3.1) Proposition (Godement Compactness Criterion). Assume that G is 
defined over Q. The homogeneous space G/Gz is compact if and only if 
Gz has no nontrivial unipotent elements. 


Proof. (=) This is the easy direction (see Corollary 4.4.4). 

(<) We prove the contrapositive: suppose G/Gz is not compact. (We 
wish to show that Gz has a nontrivial unipotent element.) From Proposi- 
tion 4.4.6 (and the fact that Gz is a lattice in G (see Theorem 5.1.11)), we 
know that there exist nontrivial y € Gz and g € G, such that 9y = Id. Be- 
cause the characteristic polynomial of a matrix is a continuous function of the 
matrix entries of the matrix, we conclude that the characteristic polynomial 
of 9y is approximately (a — 1)* (the characteristic polynomial of Id). On the 
other hand, similar matrices have the same characteristic polynomial, so this 
means that the characteristic polynomial of 7 is approximately (x —1)*. Now 
all the coefficients of the characteristic polynomial of y are integers (because 7 
is an integer matrix), so the only way this polynomial can be close to (x —1)* 
is by being exactly equal to (x —1)*. Therefore, the characteristic polynomial 
of y is (x — 1)’, so 7 is unipotent. O 


The following important consequence of the Godement Criterion tells us 
that there is often no need for compact subgroups in Definition 5.1.19, the 
definition of an arithmetic group: 

(5.3.2) Corollary. Assume 
e IT is an irreducible, arithmetic subgroup of G, 
e G/T is not compact, and 


e G is connected and has no compact factors. 
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Then, perhaps after replacing G by an isogenous group, there is an embedding 
of G in some SL(€,R), such that 


1) G is defined over Q, and 


2) T is commensurable to Gz. 


Proof. From Definition 5.1.19 (and Remark 5.1.20(1)) we know that (up to 
isogeny and commensurability) there is a compact group K’, such that we 
may embed G’ = G x K’ in some SL(¢,R), such that G’ is defined over Q, 
and [K' = G7k’. 

Let N be the almost-Zariski closure of the subgroup of G’ generated by 
all of the unipotent elements of GZ. Since G/T is not compact, the propo- 
sition implies N is infinite. However, K’ has no unipotent elements (see 
Remark A5.2(2)), so N CG. Also, the definition of N implies that it is nor- 
malized by the Zariski closure of GZ. Therefore, the Borel Density Theorem 
(4.5.7) implies that N is a normal subgroup of G. 

Assume, for simplicity, that G is simple (see Exercise 1). Then the con- 
clusion of the preceding paragraph tells us that N = G. Therefore, G is 
the almost-Zariski closure of a subset of G,, which implies that G is defined 
over Q (cf. Exercise 5.1445). Hence, Gz is a lattice in G, and it is easy to see 
that it is commensurable to TI (see Exercise 2). 


In the special case where I is arithmetic, the following result is an easy 
consequence of Proposition 5.3.1, but we will not prove the general case (which 
is more difficult). The assumption that G has no compact factors cannot be 
eliminated (see Exercise 3). 


(5.3.3) Theorem. Assume G has no compact factors. The homogeneous space 
G/T is compact if and only if T has no nontrivial unipotent elements. 


The above proof of Proposition 5.3.1 relies on the fact that Gz is a lattice 
in G, which will not be proved until Chapter 7. The following result illustrates 
that the cocompactness of Gz can sometimes be proved quite easily from the 
Mahler Compactness Criterion (4.4.7), without assuming that it is a lattice. 
(5.3.4) Proposition. If 

e B(x,y) is a symmetric, bilinear form on Q’, such that 

e B(x,x) £0 for all nonzero x € Q, 
then SO(B)z is cocompact in SO(B)p. 

Proof. Let G = SO(B) and [ = SO(B)z = Gz. (Our proof will not use the 
fact that [ is a lattice in G.) Replacing B by an integer multiple to clear the 
denominators, we may assume B(Z‘, Z*) C Z. 

Step 1. The image of G in SL(£,R)/SL(Z, Z) is precompact. Let 


e {gn} be a sequence of elements of G and 
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e {vn} be a sequence of elements of Z \ {0}. 


Suppose that gnu, — 0. (This will lead to a contradiction, so the desired 
conclusion follows from the Mahler Compactness Criterion (4.4.7).) 

Since B(v,v) 4 0 for all nonzero v € Z*, and B(Z‘,Z*) C Z, we have 
|B(vUn, Un)| = 1 for all n. Therefore 


1 < |B(un, vn)| = |B(gnUn; GnUn)| 2 |B(0,0)| = 0. 
This is a contradiction. 
Step 2. The image of G in SL(@,R)/SL(@, Z) is closed. Suppose 
Gn¥n 2h ESL(,R), with g, € G and 7, € SL(Z, Z). 


We wish to show h € G SL(£, Z). 
Let {€1,--- ,€} be the standard basis of R* (so each ¢; € Z*). Then 


B(Yn€j, YnEx) € B(Z’,Z*) CZ. 


We also have 
BYnt es Wnt) = B Gans GnVnter) — -Blhes; hex). 
Since Z is discrete, we conclude that B(yn€;,Ynéx) = Blhe;, hex) for any 


sufficiently large n. Therefore hy,' € SO(B) (see Exercise 9), so we have 
h€ Gyn C G SL(Z, Z). 


Step 3. Completion of the proof. Define ¢: G/T — SL(¢,R)/SL(Z,Z) by 
o(gl) = gSL(é,Z). By combining Steps 1 and 2, we see that the image of ¢ 
is compact. Therefore, it suffices to show that ¢ is a homeomorphism onto 
its image. 

Given a sequence {g,} in G, such that {¢(g,I)} converges, we wish to 
show that {g,I'} converges. There is a sequence {7,} in SL(@,Z), and some 
h € G, such that gnyn — h. The proof of Step 2 shows, for all large n, that 
h € Gyn. Then y, € Gh = G (and we know 7, € SL(¢,Z)), so Yn € Gz =T. 
Therefore, {g,1} converges (to hI’), as desired. O 


Exercises for §5.3. 


#1. The proof of Corollary 5.3.2 assumes that G is simple. Eliminate this 
hypothesis. 
[Hint: The proof shows that N MT is a lattice in N. Since [ is irreducible, 
this implies N = G.| 

#2. At the end of the proof of Corollary 5.3.2, show that Gz is commensu- 
rable to LI. 


[Hint: We know Gi,K'’ = TK’, and Gz has finite index in G%, (see Exer- 
cise 4.17410). Mod out K’.] 


#3. Show there is a noncocompact lattice T in SL(2,R) x SO(3), such that 


no nontrivial element of I’ is unipotent. 


[Hint: SL(2,R) has a lattice I’ that is free. Let [ be the graph of a homo- 
morphism from I” to SO(3).] 
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#4. Suppose G C SL(¢,R) is defined over Q. 

a) Show that if N is a closed, normal subgroup of G, and N is defined 
over Q, then GzN is closed in G. 

b) Show that Gz is irreducible if and only if no proper, closed, con- 
nected, normal subgroup of G is defined over Q. (That is, if and 
only if G is Q-simple.) 

c) Let H be the Zariski closure of the subgroup generated by the 
unipotent elements of Gz. Show that H is defined over Q. 


#5. Show that if every element of [ is semisimple, then G/T is compact. 


[Hint: There is no harm in assuming that G has no compact factors (why’), 
so Theorem 5.3.3 applies. ] 


#6. (assumes some familiarity with reductive groups) Prove the converse of 
Exercise 5. 
[Hint: Let kau be the real Jordan decomposition of an element g of I’. Since 
Ca(ka) is reductive (see Exercise 8.2#2), the Jacobson-Morosov Lemma pro- 
vides a subgroup L of Cg(ka) that contains u and is isogenous to SL(2, R). 
So ka is in the closure of “g. However, ©g is closed, since TI’ is discrete and 
cocompact. Therefore g = ka is semisimple.| 


#7. Assuming [ = Gz is arithmetic (and G is defined over Q), prove the 
following are equivalent: 
a) G/Gz is compact. 
) Gg has no nontrivial unipotent elements. 
) Every element of Gg is semisimple. 
d) Every element of I is semisimple. 

) Gg does not contain a subgroup isogenous to SL(2, Q). (More pre- 
cisely, there does not exist a continuous homomorphism p: SL(2,R) > 
G, such that p(SL(2,Q)) C Go.) 

[Hint: . = c) Jordan decomposition. (e = b) Jacobson-Morosov Lemma 
(5.1.10). 


#8. Show that Gg is dense in G if G is defined over Q, G is simple, and 
G/Gz is not compact. 
[Hint: The Godement Criterion implies that Gg has a nontrivial unipotent 
element u. Write u = expT = ~ T* /k! (where T € Matexe(Q) and 
T’*! = 0). Then exp(rT) € Go for all r € Q, so the identity component of 


Go is nontrivial. Combining Theorem 5.1.11 with the Borel Density Theorem 
(4.5.2) implies that Go = G\] 


#9. Let B(x, y) be a symmetric, bilinear form on R*, let {v1,--- ,ue} bea 
basis of R‘, and let y,h € SL(¢,R). If B(yv;, yur) = B(hvj, hvg) for all 
j and k, show that hy! € SO(B). 


[Hint: {yv1,..., uc} is a basis of R‘.] 
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§5.4. How to make an arithmetic subgroup 


The definition that (modulo commensurability, isogenies, and compact fac- 
tors) an arithmetic subgroup must be the Z-points of G has the virtue of 
being concrete. However, this concreteness imposes a certain lack of flexi- 
bility. (Essentially, we have limited ourselves to the standard basis of the 
vector space R”, ignoring the possibility that some other basis might be more 
convenient in some situations.) We now describe a more abstract viewpoint 
that makes the construction of general arithmetic lattices more transparent. 
(In particular, this approach will be used in 85.5.) The key point is that 
there are analogues of Z’ and Q° in any real vector space, not just R‘ (see 
Lemma 5.4.3(1)). 


(5.4.1) Definitions. Let V be a real vector space. 


1) 


A Q@subspace Vo of V is a Q-form of V if the natural R-linear map 
Vg ®g R > V is an isomorphism (see Exercise 1). (The map is defined 
by v@trH tv.) 


A polynomial f on V is defined over Q (with respect to the Q-form 
Vo) if f(Vo) C Q (see Exercise 2). 


A subgroup £ of the additive group of Vg is a Z-lattice in Vg if it is 
finitely generated and the natural Q-linear map £L ®z Q — Vo is an 
isomorphism (see Exercise 3). (The map is defined by v ®t + tw.) 


Each Q-form Vg of V yields a corresponding Q-form of the real vector 
space End(V) by End(V)g = { A € End(V) | A(Vg) C Vo } (see Exer- 
cise 5). 

A function Q on a real vector space W is a polynomial if for some 
(hence, every) R-linear isomorphism ¢: R’ & W, the composition f o¢ 
is a polynomial function on R°. 


A subgroup H of SL(V) is defined over Q (with respect to the Q-form 
Vg) if there exists a set Q of polynomials on End(V), such that 
e every Q € Q is defined over Q (with respect to the Q- form Vg), 
e Var(Q) = {g € SL(V) | Q(g) = 0 for all Q € Q} is a subgroup of 
SL(V), and 
e Var(Q)° is a finite-index subgroup of H. 


(5.4.2) Remarks. 


1) 


2) 


Suppose G C SL(¢,R), as usual. For the standard Q-form Q° of R*, it 
is easy to see that G is defined over Q in terms of Definition 5.4.1 if 
and only if it is defined over Q in terms of Definition 5.1.2. 


Some authors simply call £ a lattice in Vo, but this could cause con- 
fusion, because L is not a lattice in Vo, in the sense of Definition 4.1.9 
(although it is a lattice in V). 
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A Qform Vg and Z-lattice £ simply represent Q* and Z‘, under some 
identification of V with R*: 


(5.4.3) Lemma. Let V be an ¢-dimensional real vector space. 


1) If Vo is a Q-form of V, then there exists an R-linear isomorphism 
6: V > R*, such that d(Vo) = Q’. Furthermore, if L is any Z-lattice 
in Vg, then @ may be chosen so that ¢(L) = Z*. 


2) A polynomial f on R° is defined over Q (with respect to the standard 
Q-form Q*) if and only if every coefficient of f is in Q (see Exercise 2). 


Also note that any two Z-lattices in Vg are commensurable: 


(5.4.4) Lemma (see Exercise 6). [f £1 and Ly are two Z-lattices in Vg, then 
there is some nonzero p € Z, such that p£Ly C Lo and p£Ly C Ly. 


It is now easy to prove the following more abstract characterization of 
arithmetic subgroups (see Exercises 7 and 8). 


(5.4.5) Proposition. Suppose G C GL(V), and G is defined over Q, with 
respect to the Q-form Vo. 
1) If £L is any Z-lattice in Vo, then 
Goe={gEeG|gL=L} 
is an arithmetic subgroup of G. 
2) If Ly and Le are Z-lattices in Vo, then Ge, is commensurable to Ge, . 
From Proposition 5.4.5(2), we see that the arithmetic subgroup Gf is 


almost entirely determined by the Q-form Vg; choosing a different Z-lattice 
in Vg will yield a commensurable arithmetic subgroup. 


Exercises for §5.4. 


##1. Show that a Q-subspace Vg of V is a Q-form if an only if there is a 
subset B of Vo, such that B is both a Q-basis of Vg and an R-basis 
of V. 


#2. For the standard Q-form Q* of R‘, show that a polynomial is defined 
over Q if and only if all of its coefficients are rational. 


##3. Show that a subgroup C£ of Vo is a Z-lattice in Vg if and only if there is 
a Q-basis B of Vo, such that £ is the additive abelian subgroup of Vo 
generated by B. 


#4. Let V be a real vector space of dimension ¢, and let £ be a discrete 
subgroup of the additive group of V. Recall that the rank of an abelian 
group is the largest r, such that the group contains a copy of Z”. 

a) Show that L is a finitely generated, abelian group of rank < @, 
with equality if and only if the R-span of L is V. 
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b) Show that if the rank of £ is @, then the Q-span of L is a Q-form 
of V, and £ is a Z-lattice in Vo. 


[Hint: Induction on @. For X € L, show that the image of £ in V/RA is 
discrete. | 


#5. Verify: if Vo is a Q-form of V, then End(V)g is a Q-form of End(V). 
#6. Prove Lemma 5.4.4. Conclude that A; and Ag are commensurable. 
#7. Prove Proposition 5.4.5(1). [Hint: Use Lemma 5.4.3.] 

#8. Prove Proposition 5.4.5(2). [Hint: Use Lemma 5.4.4.] 


§5.5. Restriction of scalars 


We know that SL(2, Z) is an arithmetic subgroup of SL(2,R). In this section, 
we explain that SL(2, Z[/2]) is an arithmetic subgroup of the group SL(2, R) x 
SL(2, R) (see Example 5.5.3). More generally, recall that any finite extension 
of Q is called an algebraic number field. We will see that if O is the ring 
of algebraic integers in any algebraic number field F’, and G is defined over F, 
then Go is an arithmetic subgroup of a certain group G’ that is related to G. 


(5.5.1) Remark. In practice, we do not require O to be the entire ring of 
algebraic integers in F’: it suffices for the ring O to have finite index in the 
ring of integers (as an additive group); equivalently, the Qspan of O should 
be all of F’, or, in other words, the ring O should be a Zlattice in F. (A 
Z-lattice in F’ that is also a subring is called an order in F’..) 


Any complex vector space can be thought of as a real vector space (of 
twice the dimension). Similarly, any complex Lie group can be thought of as a 
real group (of twice the dimension). Restriction of scalars is the generalization 
of this idea to any field extension F'/L, not just C/R. This yields a general 
method to construct arithmetic subgroups. 


(5.5.2) Example. Let 


e f= Qlv2], 

e O =ZI\v2I, and 

e o be the nontrivial Galois automorphism of F, 
and define a ring homomorphism A: F — R? by A(z) = (x, o(2)). 

It is easy to show that A(Q) is discrete in R?. Namely, for x € O, the 
product of the coordinates of A(x) is the product x - o(x) of all the Galois 
conjugates of x. This is the norm of the algebraic number x. Because x is an 
algebraic integer, its norm is an ordinary integer; hence, its norm is bounded 
away from 0. So it is impossible for both coordinates of A(x) to be small 
simultaneously. 
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More generally, if O is the ring of integers of any algebraic number field F’, 
this same argument shows that if we let {o1,...,0,} be the set of all embed- 
dings of O in C, and define A: O > C” by 

AG) oie) cesene)), 
then A(O) is a discrete subring of C”. 

Now A induces a homomorphism A,.: SL(4,O) — SL(é,C") (because 
SL(é,-) is a functor from the category of commutative rings to the cate- 
gory of groups). Furthermore, the group SL(¢,C") is naturally isomorphic 
to SL(¢,C)”. Therefore, we have a homomorphism (again called A) from 
SL(£,O) to SL(é,C)”. Namely, for y € SL(¢,O), we let o;(y) € SL(é,C) be 
obtained by applying o; to each entry of y, and then 

A(y) = (o1(y), eas) or(y))- 
Since A(Q) is discrete in C", it is obvious that the image of A, is discrete in 
SL(é,C"), so A(T) is a discrete subgroup of SL(é,C)", for any subgroup [ of 
SL(é, O). 

The main goal of this section is to show that if f = Go, and G is defined 
over F’, then the discrete group A(T) is an arithmetic subgroup of a certain 
subgroup of SL(¢,C)". 

To illustrate, let us show that SL 2 Z\V2}) is isomorphic to an arithmetic 
subgroup of SL(2,R) x SL(2,R). 

(5.5.3) Example. Let 

e T =SL(2, Z[V2)]), 

e G=SL(2,R) x SL(2,R), and 

e o be the conjugation on Qhv2] (so o(a+bV2) = a—by2, for a,b € Q), 
and define A: T > G by A(y) = (7,¢(7)). 

Then A(T) is an irreducible, arithmetic subgroup of G. 


Proof. Let F = QiV2] and O = Z)|/2]. Then F is a 2-dimensional vector 
space over Q, and O is a Z-lattice in F’. 

Since {(1, 1), (V2,-V/2)} is both a Q-basis of A(F) and an R-basis of R?, 
we see that A(F) is a Q-form of R?. Therefore, 

A(F’) = { (u,o(u)) € F* |we F? } 
is a Q-form of R*, and A(O?) is a Z-lattice in A(F?). 

Now G is defined over Q (see Exercise 2), so Ga(o2) is an arithmetic 
subgroup of G. It is not difficult to see that Ga(oz) = A(P) (see Exercise 3). 
Furthermore, because A([)  (SL(2, R) x e) is trivial, we see that the lattice 
A(T) must be irreducible in G (see Proposition 4.3.3). 


More generally, the proof of Example 5.5.3 shows that if G is defined 
over Q, then Gaya is isomorphic to an (irreducible) arithmetic subgroup of 
GxG. 
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Here is another sample application of the method. 


(5.5.4) Example. Let G = SO(a? + y? — V2z?;R) & SO(1,2). Then Gay/9) 
is a cocompact, arithmetic subgroup of G. 


Proof. As above, let o be the conjugation on Qh/2]. Let P = Gz/9° 
Let K’ = SO(a? + y? + V2z7) & SO(3), so o(f) C K’. (However, 
o([) ZG.) Then, we may 
define A:T +Gx K’ by A(y) = (7,0(9)). 


Arguing as in the proof of Example 5.5.3 establishes that A(T) is an arithmetic 
subgroup of G x Kk’. (See Exercise 4 for the technical point of verifying that 
G x K’ is defined over Q.) Since K’ is compact, we see, by modding out K’, 
that Tis an arithmetic subgroup of G. (This type of example is the reason 
for including the compact normal subgroup K’ in Definition 5.1.19.) 

Let y be any nontrivial element of [. Since o(y) € K’, and compact 
groups have no nontrivial unipotent elements (see Remark A5.2(2)), we know 
that o(7) is not unipotent. Therefore, o(7y) has some eigenvalue 4 1. Hence, 
7 has the eigenvalue 0~ (A) 4 1, so 7 is not unipotent. Therefore, Godement’s 
Criterion (5.3.1) implies that [is cocompact. Alternatively, this conclusion 
can easily be obtained directly from the Mahler Compactness Criterion (4.4.7) 
(see Exercise 6). O 


Let us consider one more example before stating the general result. 


(5.5.5) Example. Let 

2 F =), 

2 0 = 2, 

e T =SL(2,0), and 

e G=SL(2,R) x SL(2,R) x SL(2,C). 
Then I is isomorphic to an irreducible, arithmetic subgroup of G. 
Proof. For convenience, let a =/2. There are exactly 4 distinct embeddings 
00; 01, 02, 03 of F in C (corresponding to the 4 roots of z+ — 2 = 0); they are 
determined by: 

oo(a) =a (so 09 = Id), o1(a) = —a, o2(a) =ia, and o3(a) = —ia. 
Define A: F > R@ROC by A(z) = (2, 01(2), 02(z)). Then, arguing much 
as before, we see that A(F?) is a Q-form of R? @R? 6C?, G is defined over Q, 
and Gao2) = A(P). O 


These examples illustrate all the ingredients of the general result that will 
be stated in Proposition 5.5.8 after the necessary definitions. 


(5.5.6) Definition. Let F be an algebraic number field (or, in other words, 
let F be a finite extension of Q). 


1) 


2) 


3) 


4) 


via the natural embedding A: 7 ++ (o(7)) 
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Two distinct embeddings 01,02: F — C are said to be equivalent 
if o1(@) = oo(x), for all « € F (where Z denotes the usual complex 
conjugate of the complex number z). 


A place of F' is an equivalence class of embeddings in C. Therefore, 
each place consists of either one or two embeddings of F: 
e a real place consists of only one embedding (with o(F') C R), but 
e a complex place consists of two embeddings (with o(F’) Z R). 


We let S° = { places of F’}, or, abusing notation, we assume that S°° 
is a set of embeddings, consisting of exactly one embedding from each 
place. 


For 0 € S™, we let 


Fe- R if o is real, 
- Cif o is complex. 


Note that o(F’) is dense in F,, so F, is often called the completion 
of F' at the place o. 
For OC Fo[x11,.-.,Xe,¢], let 

Varr,(Q) = {9 € SL(4, Fe) | Q(g) = 0, VQ € Q}. 
Thus, for F, = R, we have Varg(Q) = Var(Q), and Varc(Q) is analo- 
gous, using the field C in place of R. 


Suppose G C SL(¢,R), and G is defined over F’, so there is some sub- 
set QO of Flaii,...,2¢¢], such that G° = Var(Q)°. For each place o 
of F, let 


O° 


G? = Varr, (o(Q)) . 
Then G’, the Galois conjugate of G by ¢, is defined over o(F’). 


(5.5.7) Other terminology. Our definition requires places to be infinite 
(or archimedean); that is the reason for the superscript co on S°. Other 
authors also allow places that are finite (or nonarchimedean, or p-adic). 
These additional places are of fundamental importance in number theory, and, 
therefore, in deeper aspects of the theory of arithmetic groups. For example, 
superrigidity at the finite places will play a crucial role in the proof of the Mar- 
gulis Arithmeticity Theorem in Section 16.3. Finite places are also essential 
for the definition of the “S-arithmetic” groups discussed in Appendix C. 


(5.5.8) Proposition. If G is defined over an algebraic number field FC R, 
and O 1s the ring of integers of F’, then there is a finite-index subgroup Go 
of Go, such that 


Go embeds as an arithmetic subgroup of I] G°, 
oESe 


oEse 


Furthermore, if G is simple, then the lattice A(Go) is irreducible. 
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(5.5.9) Warning. By our definition, G? is always connected, since it is the 
identity component of Varr, (o(Q)). If G is assumed to be Zariski closed 
(so it is equal to Var(Q), rather than merely being isogenous to it), then it is 
sometimes more convenient to define G? to be the entire variety Var p, (o( Q)), 
rather than merely the identity component. In particular, that would elim- 
inate the need to pass to a finite-index subgroup Go in the statement of 
Proposition 5.5.8. Taking the best of both worlds, we will usually ignore 
the difference between Go and Go, and pretend that the map A of Propo- 
sition 5.5.8 is defined on all of Go. For example, the statements of Corol- 
lary 5.5.10 and Proposition 5.5.12 below omit the dots that should be in 
A(Go) and ¢(A(Ho)). 


The argument in the last paragraph of the proof of Example 5.5.4 shows 
the following: 


(5.5.10) Corollary. If G° is compact, for some ao € S%, then A(Go) is 
cocompact. 


(5.5.11) Remark. Proposition 5.5.8 is stated only for real groups, but the same 
conclusions hold if 

e GCSL(Z,C), 

e F is an algebraic number field, such that F ¢ R, and 


e G is defined over F’, as an algebraic group over C; that is, there is 
a subset Q of F[x11,..., xe], such that G° = Varc(Q)° (see Nota- 
tion 18.1.3). 


For example, we have the following irreducible arithmetic lattices: 
1) SO(n, Z[i,v/2]) in SO(n, C) x SO(n, C), and 


2) sO(n,Z Mw = v2] ) in SO(n, C) x SO(n, R) x SO(n, R). 


The following converse shows that restriction of scalars is the only way 
to make a group of Z-points that is irreducible. 


(5.5.12) Proposition. Jf [T = Gz is an irreducible lattice in G (and G is 
connected), then there exist 


1) an algebraic number field F', with completion F,, (= R or C), 


2) a connected, simple subgroup H of SL(l,F.), for some £, such that 
H is defined over F' (as an algebraic group over F5,), and 
3) an isogeny 
¢: [|] H° 64, 
oEese 
such that 6(A(Ho)) is commensurable to T. 


Proof. It is easier to work with the algebraically closed field C, instead of R, 
so, to avoid minor complications, let us assume that G C SL(é,C) is defined 
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over Q|?] (as an algebraic group over C), and that T = Gz;,j. This assumption 
results in a loss of generality, but similar ideas apply in general. 

Write G = G, x --- x G,, where each G; is simple. Let H = G;. We 
remark that if r = 1, then the desired conclusion is obvious: let F' = QJil, 
and let @ be the identity map. 

Let © be the Galois group of C over Q|?]. Because G is defined over QJi], 
we have o(G) = G for every o € &. Hence, o must permute the simple factors 
{Gi,..-,Gr}. 

We claim that © acts transitively on {G1,...,G,}. To see this, suppose, 
for example, that r = 5, and that {G,,G 2} is invariant under ©. Then 
A = Gj, X Gp is invariant under %, so A is defined over QJ/i]. Similarly, 
A’ = G3 x G4 X Gs is also defined over Q|i]. Then Az;,) and Arr are lattices 
in A and A’, respectively, so! = Gz & Aza x Arr is reducible. This is a 
contradiction. 

Let 

%,={a0€]|o(G,)=G,} 
be the stabilizer of G;, and let 
F={zeC|o(z)=2z, Voed,} 
be the fixed field of ©;. Because % is transitive on a set of r elements, we 
know that /, is a subgroup of index r in 4, so Galois Theory tells us that 
F is an extension of QJ|] of degree r. 

Since 4; is the Galois group of C over F’, and o(G,) = G, for all o € &4, 
we see that G is defined over F’. 

Let o1,...,0, be coset representatives of ©; in &. Then oj|p,...,0,|rF 


are the r places of F' and, after renumbering, we have G; = 0;(G1). So (with 
H =G)), we have 


I] H° = A%lry...x. AerlF = 01(G1) X---xo0,(G1) = G,x-:-xG, =G. 
oESe 
Let ¢ be the identity map. 
For h € Hr, let A’(h) = TTj=1 oj(h). Then o(A’(h)) = A’(h) for allo € 
u, so A’(h) € Ggyy. In fact, it is not difficult to see that A’( Hr) = Gaya, and 
then one can verify that A’(Ho) © Gzj) =T, so o(A(Ho) is commensurable 
to I. 


(5.5.13) Remark. Although it may not be clear from our proof, the group 
G’ in Corollary 5.5.15 can be chosen to be “absolutely simple”? This means 
that if F C R, then the following three equivalent conditions must be true: 
G’ remains simple over C, g/ ®g C is simple, and G’ is not isogenous to any 
“complexification” (G"’)c. 


Combining Proposition 5.5.12 with Corollary 5.5.10 yields the following 
result. 
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(5.5.14) Corollary. If Gz is an irreducible lattice in G, and G/Gz is not 
cocompact, then G has no compact factors. 


By combining Proposition 5.5.12 with Definition 5.1.19, we see that every 
irreducible arithmetic subgroup can be constructed by using restriction of 
scalars, and then modding out a compact subgroup: 


(5.5.15) Corollary. Jf T is an irreducible, arithmetic lattice in G (and G is 
connected), then there exist 


1) an algebraic number field F', with completion F,, (= R or C), 
2) a connected, simple subgroup G' of SL(¢, Fs), for some &, such that 
G’ is defined over F' (as an algebraic group over F',), and 
3) a continuous surjection 
¢: |] @)7 4G 
oEse 
with compact kernel, 
such that (A(G)) is commensurable to T. 
When G is simple, the restriction of @ to some simple factor of |], 6g. (G’)” 


must be an isogeny, so the conclusion can be stated in the following much sim- 
pler form: 


(5.5.16) Corollary. If T is an arithmetic subgroup of G, and G is simple, 
then there exist 


1) an algebraic number field F', with completion F,, (= R or C), 


2) a connected, simple subgroup G' of SL(¢, Fs), for some £, such that 
G’ is defined over F' (as an algebraic group over F.), and 


3) an isogeny ¢: G' >G, 
such that ¢(G) is commensurable to I. 
However, we should point out that this result is of interest only when 
I is cocompact (or is reducible with at least one cocompact factor). This is 


because there is no need for restriction of scalars when the irreducible lattice 
is not cocompact (see Corollary 5.3.2). 


Exercises for §5.5. 


#1. In the notation of the proof of Example 5.5.3, show, for the Q-form 
A(F?) of R*, that 


am {Ly al] teaser} 


[Hint: Since the F-span of A(F?) is F*, we have End(R*)o C Mataxa(F). 
Thus, for any T € End(R*)9, we may write T = E a with A,B,C,DeE 


End 


os 


#2. 


#3. 


4A, 


#5. 


#6. 
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Mat2x2(F’). Now use the fact that, for all u € F”, we have T(u) = (v,o(v)), 
for some v € F?.] 
In the notation of the proof of Example 5.5.3, let 


Q={Xij42 + 2i42,5, Va jtoVita; | 1 <i,7 <2} 


1 
U ia (#1122, — %12%2,1) — (€3,3%4,4 — vaatsa))} . 


a) Use the conclusion of Exercise 1 to show that each Q € Q is defined 
over Q. 
b) Show that Var(Q)° = SL(2, R) x SL(2,R). 


In the notation of the proof of Example 5.5.3, use Exercise 1 to show 
that Gao?) = A(F); 


Let F', O, 0, A be as in the proof of Example 5.5.3. If G C SL(¢,R), 
and G is defined over F’, show G x G is defined over Q (with respect to 
the Q-form on End(R?*) induced by the Q-form A(F*) on R?*). 

[Hint: For each Q € Q|a1,1,..., 22,2], let us define a corresponding polynomial 


Qt € Qlae+ie41,---,£20,2e] by replacing every occurrence of each variable 
xi,j With Xe+i,e4;. For example, if = 2, then 


(a4 + £1,2%2,1 — 321,1%2,2)* = 13,3 + £3,4%4,3 — 3%3,30%4,4. 
Choose Qo C Q[x1,1,..., 22,0] that defines G as a subgroup of SL(¢,R), and 
let 
Q1={Q+ (Qt), Qo(Q*t) | QE Qo}. 
A natural generalization of Exercise 2 shows that SL(@,R) x SL(¢, R) is defined 


over Q: let Q2 be the corresponding set of Q-polynomials. Now define Q = 
OQ; U Q2.] 


Suppose QO is the ring of integers of an algebraic number field F’. 
a) Show A(Q) is discrete in Qo ego Fo- 
b) Show A(F) is a Q-form of Qo ego Fo- 
c) Show A(Q) is a Z-lattice in A(F)). 
Let 
e Biv, w) = vw) + vow, — V2v3w3, for v,w € R3, 
e G=SO(B)°, 
e G*=GxG’, 
e T= Gaya and 
e T*=A(T). 
Show: 


a) The image of G* in SL(6,R)/ SL(6, R)a(os) is precompact (by us- 
ing the Mahler Compactness Criterion). 
b) The image of G* in SL(6, R)/ SL(6, R) aos) is closed. 
c) G*/T* is compact. 
d) G/T is compact (without using the fact that T is a lattice in G). 
[Hint: This is similar to Proposition 5.3.4.] 
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#7. For any algebraic number field F’, the Q-form A(F*) on @jegx (Fo)! 

induces a natural Q-form on Endg(@,¢5~(F,)"). Show the group 
[ego SL(4, Fa) is defined over Q, with respect to this Q-form. 
[Hint: This is a generalization of Exercise 2. That proof is based on the 
elementary symmetric functions of two variables: P;(ai,a2) = a1 + a2 and 
P2(ai1,a2) = aia2. For the general case, use symmetric functions of d vari- 
ables, where d is the degree of F over Q.| 

#8. Suppose G C SL(@,R), and G is defined over an algebraic number field 
F CR. Show |[,¢5 G is defined over Q, with respect to the Q-form 
on Ende (@,egeo(Fo)*) induced by the Q-form A(F*) on @ jeg (Fo). 


[Hint: This is a generalization of Exercise 4. See the hint to Exercise 7.] 


#9. Show, for all m,n > 1, with m+n > 3, that there exist a lattice [ in 
SO(m,n), and a homomorphism p: T + SO(m-+n), such that p(T) is 
dense in SO(m +7). 


§5.6. Only isotypic groups have irreducible lattices 


Intuitively, the complexification Gc of G is the complex Lie group that 
is obtained from G by replacing real numbers with complex numbers. For 
example, SL(n, R)c = SL(n,C), and SO(n)c = SO(n, C). (See Section 18.1 
for more discussion of this.) 


(5.6.1) Definition. G is isotypic if all of the simple factors of Gc are isoge- 
nous to each other. 


For example, SL(2,R) x SL(3,R) is not isotypic, because SL(2, C) is not 
isogenous to SL(3,C). Similarly, SL(5,R) x SO(2,3) is not isotypic, be- 
cause the complexification of SL(5, R) is SL(5,C), but the complexification of 
SO(2,3) is (isomorphic to) SO(5,C). Therefore, the following consequence 
of the arithmeticity theorem implies that neither SL(2,R) x SL(3,R) nor 
SL(5, R) x SO(2, 3) has an irreducible lattice. 


(5.6.2) Theorem (Margulis). Assume that G has no compact factors. If G 
has an irreducible lattice, then G is isotypic. 


Proof. Suppose [ is an irreducible lattice in G. We may assume that G 
is not simple (otherwise, the desired conclusion is trivially true), so G is 
neither SO(1,n) nor SU(1,n). Therefore, from the Margulis Arithmeticity 
Theorem (5.2.1), we know that [ is arithmetic. Then, since I is irreducible, 
Corollary 5.5.15 implies there is a simple subgroup G’ of some SL(é, R), and 
a compact group K, such that 


e G’ is defined over a number field F’, and 


eee (Gy? 
So the simple factors of G x K are all in { (G’)” | 0 € S™ } (up to isogeny). 
It then follows from Lemma 5.6.5 below that G is isotypic. O 


e G x K is isogenous to [| 
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(5.6.3) Remarks. 

1) We will prove the converse of Theorem 5.6.2 in Proposition 18.7.5 (with- 
out the assumption that G has no compact factors). 

2) By arguing just a bit more carefully, it can be shown that Theo- 
rem 5.6.2 remains valid when the assumption that G has no compact 
factors is replaced with the weaker hypothesis that G is not isogenous 
to SO(1,n) x K or SU(1, n) x K, for any nontrivial, connected compact 
group K (see Exercise 2). 


The following example shows that a nonisotypic group can have irre- 
ducible lattices, so some restriction on G' is necessary in Theorem 5.6.2. 


(5.6.4) Example. SL(2,R) x K& has an irreducible lattice, for any connected, 
compact Lie group K (cf. Exercise 4.9#7). 


We now complete the proof of Theorem 5.6.2: 


(5.6.5) Lemma. Assume G is defined over an algebraic number field F. If o 
is a place of F, and G is simple, then the complezification of G is isogenous 
to the complexification of G°. 


Proof. Extend o to an automorphism o of C. Then a(Gc) = (G7)c, so it 
is clear that Gc is isomorphic to (G’)c. Unfortunately, however, the auto- 
morphism o is not continuous (not even measurable) unless it happens to be 
the usual complex conjugation, so we have only an isomorphism of abstract 
groups, not an isomorphism of Lie groups. Hence, this observation is not a 
proof, although it is suggestive. To give a rigorous proof, it is easier to work 
at the Lie algebra level. 

First, let us make an observation that will also be pointed out in Re- 
mark 18.2.2. If G = SL(n,C), or, more generally, if G is isogenous to a 
complex group G&, then Gc = G x G (because C@p C = C@C). So Ge is 
not simple. However, it can be shown that this is the only situation in which 
the complexification of a simple group fails to be simple: if G is simple, but 
Gc is not simple, then G is isogenous to a complex simple group Ge. There- 
fore, although the complexification of a simple group is not always simple, it 
is always isotypic. 

Now assume, for definiteness, that F' C R (see Exercise 6). Since G is 
defined over fF’, its Lie algebra g is also defined over F’. This means there 
is a basis {v1,...,Un} of g, such that the corresponding structure constants 
{cf ijke=1 all belong to F; recall that the structure constants are defined 
by the formula 

v5, VK] = opr C5 RUE. 

Because G' is isogenous to a group that is defined over Q (see Proposi- 
tion 5.1.5), there is also a basis {u,,...,Un} of g whose structure constants 
are in Q. Write vz = ee akue with each ay, € R, and define 


vy = i G (aj, ue. 
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Then vf,...,u? is a basis of J®pC whose structure constants are {o(c§ ,)} 
These are obviously the structure constants of the Lie algebra g” of G’. 
If o(F) C R, then the R-span of {vf,...,v@} is (isomorphic to) g’, so 
its C-span is g° pC. Since vf,...,v? is also a basis of g ®p C, we conclude 
that (G7)c is isogenous to Gc. 
Finally, if o(f) ¢ R, then the C-span of {vf,...,v%} is (isomorphic 
to) 97, so g ®r C= g’. This implies that (G7 )c is isogenous to Ge. O 


n 
j,k L=V" 


(5.6.6) Remark. The proof of Lemma 5.6.5 used our standing assumption that 
G is semisimple only to show that G is isogenous to a group that is defined 
over Q. See Exercise 3 for an example of a Lie group H, defined over an 
algebraic number field F’ C R, and an embedding o of F in R, such that 
H x H7’ is not isotypic. 


Exercises for §5.6. 


#1. Show, for m,n > 2, that SL(m,R) x SL(n, R) has an irreducible lattice 
if and only ifm =n. 


#2. Suppose G is not isogenous to SO(1,n) x K or SU(1,n)x K, for any non- 
trivial, connected compact group K. Show that if G has an irreducible 
lattice, then G is isotypic. 

[Hint: Use Remark 5.2.3 to modify the proof of Theorem 5.6.2.] 


#3. (optional) For a € C x {0,—1}, let hy be the 7-dimensional, nilpotent 
Lie algebra over C, generated by {x1, 22,23}, such that 


e (ha, £1, 21] — (ha, 2, £2] = [bas £3, 23] = 0, and 
e (x2, 23,21] = a[x1, £2, 3). 
a) Show that [x3, 21,22] = —(1+ @)[%1, x2, x3]. 


b) For h € hq, show that [h,,h,h] = 0 if and only if there exists 
x € {x1,%2,23} and t € C, such that h € ta + [Ba, Bal. 


c) Show Da =Dz iff BE {a, 4,-(1 +a), eee are ite 
d) Show that if the degree of Q(a) over Q is at least 7, then there is 


a place o of Q(a), such that . is not isomorphic to (ha)’. 


#4. (optional) In the notation of Exercise 3, show that if the degree of Q(a) 
over Q is at least 7, then h, is not isomorphic to any Lie algebra that 
is defined over Q. 

#5. (optional) In the notation of Exercise 3, show, for a = V2 — (1/2), that 
hq is isomorphic to a Lie algebra that is defined over Q. 
[Hint: Let y: = 21 +22 and yo = (#1 — 22)/V/2. Show that the Q-subalgebra 
of ha generated by {y1, y2, 73} is a Q-form of ha.| 

#6. Carry out the proof of Lemma 5.6.5 for the case where F' ¢ R. 


[Hint: Write g = g’ @r C and let {u1,...,un} be a basis of g’ with rational 
structure constants. Show that G is isogenous to either G® or (G7 )c.| 
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Notes 


The fact that G is unirational (used in Exercise 5.143) is proved in [4, 
Thm. 18.2, p. 218). 

The Margulis Arithmeticity Theorem (5.2.1) was proved by Margulis [9, 
11] under the assumption that rankgG > 2. (Proofs also appear in [12, 
Thm. A, p. 298] and [16].) Much later, the superrigidity theorems of Corlette 
[6] and Gromov-Schoen [7] extended this to all groups except SO(1,n) and 
SU(1,n). 

Proposition 5.1.5 is a weak version of a theorem of Borel [2]. (A proof 
also appears in [14, Chap. 14].) 

The Commensurability Criterion (5.2.5(1)) is due to Margulis [10]. We 
will see it again in Theorem 16.3.3, and it is proved in [1], [12], and [16]. 

The fact that all noncocompact lattices have unipotent elements (that 
is, the generalization of Theorem 5.3.3 to the nonarithmetic case) is due to 
D. Kazhdan and G. A. Margulis [8] (or see [3] or [14, Cor. 11.13, p. 180}). 

The standard reference on restriction of scalars is [15, $1.3, pp. 4-9]. (A 
discussion can also be found in [13, §2.1.2, pp. 49-50].) 

Proposition 5.5.12 (and Remark 5.5.13) is due to A. Borel and J. Tits [5, 
6.21(ii), p. 113]. 

See [12, Cor. IX.4.5, p. 315] for a proof of Theorem 5.6.2. 
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Chapter 6 


Examples of 
Arithmetic Groups 


§6.1. Arithmetic subgroups of SL(2,R) via orthogonal groups 


SL(2, Z) is the obvious example of an arithmetic subgroup of SL(2,R). Later 
in this section, we will show that (up to commensurability and conjugates) 
it is the only one that is not cocompact (see Proposition 6.1.5). In contrast, 
there are infinitely many cocompact, arithmetic subgroups. They can be 
constructed by several different methods. Perhaps the easiest way is to note 
that SL(2,R) is isogenous to the special orthogonal group SO(2, 1). 


(6.1.1) Notation. In this chapter (and others), we will see many different 
special orthogonal groups over a field F. They can be specified in (at least) 
three different, but equivalent ways: 
1) (Gram matrix) For a symmetric, invertible matrix A € Mat;,.¢(F’), we 
define 
SO(A; F) = {g € SL(n, F) | g7 Ag = A}. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, Tis a lattice in the semisimple Lie group G C SL(¢, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. _ http: //creativecommons.org/publicdomain/zero/1.0/ 


Mai St ition of arithmetic subgroup 
(Section 5.1), Godement Criterion (Proposition 5.3.1), and restriction of 
scalars (Section 5.5). 
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This is the approach taken to the definition of SO(m,n) in Exam- 
ple A2.3. 

2) (Bilinear form) A symmetric, bilinear form B on F“ is nondegenerate 
if, for all nonzero v € F*, there exists w € F*, such that B(v,w) # 0. 
We define 

SO(B; F) = {9 € SL(é,F) | B(gv, gw) = B(v,w), Vu,w € F*}. 

3) (Quadratic form) A quadratic form on F* is a homogeneous polyno- 
mial Q(x1,...,a2) of degree 2. It is nondegenerate if the correspond- 
ing bilinear form Bg is nondegenerate, where 


Ba(v, w) = 4(Q(v + w) — Q(v— w)). 
We define 
SO(Q; F) = {9 € SL(¢, F) | Q(gv) = Q(v), Vu € F*}. 
The three approaches give rise to exactly the same groups (see Exercise 1), 
and it is straightforward to translate between them, so we will use whichever 
notation is most convenient in a particular context. 
(6.1.2) Examples. 
1) Fix positive integers a and b, and let 
G = SO(ax? + by? — z?;R) = SO(2, 1). 
If (0,0,0) is the only integer solution of the Diophantine equation 
ax? + by? = z?, then Gz is a cocompact, arithmetic subgroup of G 


(see Proposition 5.3.4). See Exercise 2 for some examples of a and b 
satisfying the hypotheses. 


2) Restriction of scalars (see Section 5.5) allows us to use algebraic number 
fields other than Q. Let 
e F#Q bea totally real algebraic number field (that is, an alge- 
braic number field with no complex places), 
e a,b € F*, such that o(a) and o(b) are negative, for every place 
o #ld, 
e O be the ring of integers of F’, and 
e G = SO(axz? + by? — z?;R) & SO(2, 1). 
Then the group Go is a cocompact, arithmetic subgroup of G (cf. Ex- 
ample 5.5.4, or see Proposition 5.5.8 and Corollary 5.5.10). See Exer- 
cise 3 for an example of F’, a, and b satisfying the hypotheses. 
3) In both (1) and (2), the group G is conjugate to SO(2,1), via the 
diagonal matrix 
g = diag(V/a, Vb, 1). 
Therefore, g~!(Gz)g or g~'(Go)g is a cocompact, arithmetic subgroup 
of SO(2, 1). 


(6.1.3) Remark. For a and 6 as in Example 6.1.2(2), (0,0,0) is the only so- 
lution in O° of the equation ax? + by? = 2? (see Exercise 4). Therefore, 
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Example 6.1.2(1) and Example 6.1.2(2) could fairly easily be combined into 
a single construction, but we separated them to keep them a bit less compli- 
cated. 


(6.1.4) Proposition. The only cocompact, arithmetic subgroups of SO(2, 1) 
are the arithmetic subgroups constructed in Example 6.1.2 (up to commensu- 
rability and conjugates). 

More precisely, any cocompact, arithmetic subgroup of SO(2,1) has a 
conjugate that 1s commensurable to an arithmetic subgroup constructed in 
Example 6.1.2. 


Proof. Let I be a cocompact, arithmetic subgroup of SO(2,1). Ignoring the 
minor technical issue that not all automorphisms are inner (cf. Remark A6.4), 
it suffices to show that there is an automorphism a of SO(2,1), such that 
a([) is commensurable to one of the arithmetic subgroups constructed in 
Example 6.1.2. 


Step 1. There are 


e an algebraic number field F CR, with ring of integers O, 

e asymmetric, bilinear form B(x,y) on F?, and 

e an isomorphism ¢: SO(B;R) > SO(2, 1), 
such that o(SO(B; O)) is commensurable to [. We give two proofs. 

First, we note that this follows from the classification results that will be 
proved in Chapter 18. Namely, a group of the form SO(m, n) does not appear 
in Proposition 18.5.6, and it arises as the right-hand side of two different parts 
of Proposition 18.5.7. However, m+n = 1+2 =8 is odd in our situation, so 
only one of the listings is relevant: Gr must be SO(A; F’), for some algebraic 
number field F C R. This means that [ is commensurable to SO(A; 0), 
where O is the ring of integers of F’. 

Second, let us give a direct proof that does not rely on the results of 
Chapter 18. Because all (irreducible) arithmetic subgroups are obtained by 
restriction of scalars, and G is simple, Corollary 5.5.16 tells us there are 


e an algebraic number field F C R, with ring of integers O, 
e asimple Lie group H C SL(¢,R) that is defined over F’, and 
e an isogeny ¢: H > SO(2, 1), 


such that ¢(Ho) is commensurable to [’. All that remains is to show that we 
may identify Hr with SO(B; F), for some symmetric bilinear form B on F°. 
The Killing form 


K(u,v) = trace((ady u)(adp v)) 
is a symmetric, bilinear form on the Lie algebra §. It is invariant under 


Ad H, so Ady is an isogeny from H to SO(«K;R). Pretending that Ady is 
an isomorphism, not just an isogeny, we may identify H with SO(«;R). Note 
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that «(hr, 6) C F, so, by identifying hr with F°, we may think of & as a 
bilinear form on F?. 


Step 2. We may assume that B(x,x) = ax? + ba} — x2 for some a,b € FT. 
By choosing an orthogonal basis that diagonalizes the form, we may assume 
B(a2, x) = ax} +br3+cxrz. Since SO(B; R) » SO(2, 1), we know that +B(z, x) 
has signature (2,1). So we may assume a,b, —c € F'*. Dividing by c (which 
does not change the orthogonal group) yields the desired form. 


Step 3. F is totally real, and both a(a) and a(b) are negative, for all places o # 
Id. Since A(Go) is an irreducible lattice in [],¢g.. G® (see Proposition 5.5.8), 
but the projection to the first factor, namely G, is , which is discrete, we 
know that G? is compact, for allo #4 Id. This implies G7 © SO(3), so F, = R, 
and the three real numbers o(a), o(b), and o(—1) all have the same sign. 


Step 4. B is anisotropic over F’. Since Go is cocompact, it has no nontrivial 
unipotent elements (see Corollary 4.4.4). Therefore B(x,x) # 0, for every 
nonzero x € F® (see Exercise 5). O 


(6.1.5) Proposition. SL(2,Z) is the only noncocompact, arithmetic sub- 
group V of SL(2,R) (up to commensurability and conjugates). 


Proof. Let us consider the isogenous group SO(2, 1), instead of SL(2, R). 


Step 1. There are 

e asymmetric, bilinear form B(x,y) on Q?, and 

e an isogeny @: SO(B;R) > SO(2, 1), 
such that $(SO(B; Z)) is commensurable to T. Since [ is not cocompact, 
there is an isogeny ¢: G + SO(2,1), such that G is defined over Q and $(Gz) 


is commensurable to I’ (see Corollary 5.3.2). The argument in Steps 1 and 3 
of the proof of Proposition 6.1.4 shows that we may assume G = SO(B;R). 


Step 2. We may assume B(x, x) = x? +23 — x3. Because [ is not cocompact, 
we know that B is isotropic over F' (see Proposition 5.3.4). So there is some 
nonzero u € F?, such that B(u, u) = 0. Choose v € F?, such that B(u, v) 4 0. 
By adding a scalar multiple of u to v, we may assume B(v,v) = 0. Now choose 
a nonzero w € F°? that is orthogonal to both u and v. After multiplying B 
and u by appropriate scalars, we may assume B(w, w) = 2B(u,v) = 1. Then 
B has the desired form with respect to the basis w,u + v,u— v. O 


(6.1.6) Remark. As a source of counterexamples, it is useful to remember that 
SL(2, Z) contains a free subgroup of finite index (see Exercise 4.945 or 4.9#6). 
This implies that (finitely generated) nonabelian free groups are lattices in 
SL(2, R). 
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Exercises for §6.1. 


#1 


#2. 


#3. 


4A, 


5. 


. Show: 


a) If Q is nondegenerate quadratic form on F“, and Bg is defined 
as in Notation 6.1.1, then Bg is a bilinear form, and we have 
SO(Bo: F) = B(Q; F). 

b) If B is a nondegenerate bilinear form on F‘, and we define Q(x) = 
B(ax,x), then Q(x) is a quadratic form, and B = Bg. 

c) If Ais asymmetric, invertible matrix in Mate,¢(F’), and we define 
B(v,w) = v? Au, then B is a nondegenerate bilinear form, and 
SO(B; F) = SO(A; F). 

d) If B is a nondegenerate bilinear form on F*, and {é1,...,€e} is the 
standard basis of F’, then the matrix A = (B(¢;,¢j)) is invertible 
and symmetric, and we have SO(A; F’) = SO(B; F). 


Suppose p is a prime, such that 2? + y? = 0 (mod p) has only the 
trivial solution x = y = 0 (mod p). (For example, p could be 3.) Show 
that (0,0,0) is the only integer solution of the Diophantine equation 
px? + py? = 27. 
Let F = Qlv2, v3], and a=b= V2+ /3—3. Show 
a) F is a totally real extension of Q, 
b) a is positive, and 
c) o(a) is negative, for every place o # Id. 


If a and b are elements of an algebraic number field F’, and there is a 
real place o of F’, such that o(a) and o(b) are negative, show (0, 0,0) 
is the only solution in F° of the equation ax? + by? = z?. 

In Step 4 of the proof of Proposition 6.1.4, verify the assertion that 
B(x,x) #0, for every nonzero x € F®. 

[Hint: If B(a,x) = 0 for some nonzero x, then, after a change of basis, 


B(a,x) is a scalar multiple of the form #123 + £3, which is invariant under 
the unipotent transformation x71 > x1 — 2x%2 — 13, 42 +4 2 +. 13, 13 4 23.] 


§6.2. Arithmetic subgroups of SL(2,R) via quaternion algebras 


In the preceding section, we constructed the cocompact, arithmetic subgroups 
of SL(2, R) from orthogonal groups. As an alternative approach, we will now 
explain what quaternion algebras are, and how they can be used to construct 
those same arithmetic subgroups. In later sections (and later chapters), the 
use of quaternion algebras will sometimes be necessary, not an alternative 
approach. 
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(6.2.1) Definitions. 


1) For any field F’, and any nonzero a,b € F, the corresponding quater- 
nion algebra over F is the ring 


Ag? = {p+qitrj+sk|p.a,r,s € F}, 


where 
e addition is defined in the obvious way, and 
e multiplication is determined by the relations 


P=a, j=), ij=k=—-ji, 


together with the requirement that every element of F' is in the 
center of D. (Note that k? = k-k = (—ji)(ij) = —aj? = —ab.) 
2) The reduced norm of x=p+qit+trj+skeH ae is 
Nea 7 p? — aq’? — br? + abs? € F, 
where © = p—qi—rj—sk is the conjugate of x. (Note that Ty = 77.) 


(6.2.2) Example. 
1) We have H,’’ ' = H. 
2 


t7a,t7b sb : 
We have H,,°" ° = H®” for any nonzero a, b,t € F' (see Exercise 1). 


F 
4) We have Nrea(gh) = Nrea(g) « Nrea(h) for g,h € HY”. 


) 
3) We have E eb es Mat2,2(F), for any nonzero a,b € F (see Exercise 2). 
) 


(6.2.3) Lemma. We have E fale = Matex2(C), for all a,b € C, and 


a,-1 ,, ) Matex2(R) ifa>0, 
an H ifa <0. 


Proof. This follows from the observations in Example 6.2.2. LO 


(6.2.4) Proposition. Fix positive integers a and b, and let 
G = SL(1,HR") = {g € Hg” | Nrea(g) = 1}. 


Then: 
1) G=SL(2,R), 
2) Gz= SL(1, Hy”) is an arithmetic subgroup of G, and 
3) the following are equivalent: 
(a) Gz is cocompact in G. 
(b) (0,0,0,0) is the only integer solution (p,q,r,s) of the Diophantine 
equation 


w* — ax? — by” + abz” = 0. 


(c) Every nonzero element of nS” has a multiplicative inverse (so sl 
is a “division algebra”). 
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Proof. (1) Define an R-linear bijection ¢: He — Matex2(R) by ¢(1) = Id, 


MOV ecu OR |p ale COA ao: 
It is straightforward to check that ¢ preserves multiplication, so @ is a ring 
isomorphism. 

Forg=pt+qt+rj+ske He”, we have 

det ($(9)) = (p + aVa)(p — qV/a) — (r + 8a) (br — bs v/a) 
= p* — aq? — br? + abs? 
= Nrea(g)- 
Therefore, (G) = SL(2, R). 

(2) For g € G, define T,: He” + Hg? by T,(v) = gv. Then Ty, is R- 
linear. For y € a, we have T,( Hy”) c HG if and only if y € ae So 
Gz=GnNH eg is an arithmetic subgroup of G. 

(3c = 3a) We prove the contrapositive. Suppose Gz is not cocompact. 


Then the Godement Criterion (5.3.1) tells us that it has a nontrivial unipotent 
element y. So 1 is an eigenvalue of 7; that is, there is some nonzero v € 


H Ne such that T,(v) = v. By definition of T,, this means yu = v. Hence 
(y —1)v = 0. Since y 4 1 and v ¥ 0, this implies v is a zero divisor, so it 
certainly does not have a multiplicative inverse. 


(3a = 3c) We prove the contrapositive. Suppose EH oF is not a division 
algebra. Then sib ~ Mat2,2(Q) (see Exercise 3). This implies SL(1, oP) 
SL(2,Z) is not cocompact. (It has nontrivial unipotent elements.) 

(3b = 3c) See Exercise 4. 


The following can be proved similarly (see Exercise 5). 


(6.2.5) Proposition. Let 
e F be a totally real algebraic number field (with F 4 Q), 
e O be the ring of integers of F, 


e a,b € O, such that a and b are positive, but o(a) and a(b) are negative, 
for every place o € 1d, and 


e G=SL(1, Hg"). 
Then: 
1) G=SL(2,R), and 


2) Go= SL(1, H pa is a cocompact, arithmetic subgroup of G. 


(6.2.6) Proposition. Every cocompact, arithmetic subgroup of SL(2,R) ap- 
pears in either Proposition 6.2.4 or 6.2.5 (up to commensurability and con- 
jugates). 
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Proof. This can be proved directly, but we will instead derive it as a corollary 
of Proposition 6.1.4. For each arithmetic subgroup I’ of SO(2, 1), constructed 
in Example 6.1.2, we find an isogeny ¢: SL(2,R) — SO(2,1), such that 6(T’) 
is commensurable to an arithmetic subgroup constructed in Proposition 6.2.4 
or 6.2.5. 

(1) First, let us show that every arithmetic subgroup of type 6.1.2(1) 
appears in (6.2.4). Given positive integers a and b, such that (0,0,0) is the 
only rational solution of the equation ax? + by? = z?, let 

G = SL(1, Hg”) © SL(2,R). 
One can show that (0,0,0,0) is the only rational solution of the equation 
w? — ax? — by? + abz? = 0 (see Exercise 6), so Gz is a cocompact, arithmetic 
subgroup of G (see Proposition 6.2.4). 
As a subspace of H Be the Lie algebra g of G is 
g= {ve He” | Rev=0} 
(see Exercise 7). For g € Gand v € g, we have (Adg g)(v) = gug~*, 80 Nrea |g 


is a quadratic form on g that is invariant under AdG'p. For v = xit+yj+zk € 
g, we have 


1 


Nicdlo) = —ax* — by? + abz?. 

After the change of variables x 4 by and y + az, this becomes —ab(ax? + 
by? — 2”), which is a scalar multiple of the quadratic form in 6.1.2(1). There- 
fore, after identifying g with R® by an appropriate choice of basis, the arith- 
metic subgroup constructed in Example 6.1.2(1) (for the given values of a 
and b) is commensurable to Adg Gz. 

(2) Similarly, every arithmetic subgroup of type 6.1.2(2) appears in (6.2.5) 
(see Exercise 8). oO 


Exercises for §6.2. 


2 2 
~ ,b 
#1. Show H;,°" 7 = H®’, for any nonzero u,v € F. 


[Hint: An isomorphism is given by 1+> 1, it ui, J vj, kt uvk.] 


#2. Show qa ~ Mato ,2(F), for any field F’', and any a,b € F. 
[Hint: See the proof of Proposition 6.2.4(1).] 


#3. Show that if the ring Hg” is not a division algebra, then it is isomorphic 


to Mat2x2(Q). 

[Hint: This follows from Wedderburn’s Theorem (6.8.5), but can also be 
proved directly: if x is not invertible, then xy = 0 for some y, so the left ideal 
generated by x is a 2-dimensional subspace on which Hg” acts faithfully. ] 


#4. Show that every nonzero element of HI has a multiplicative inverse if 
and only if the reduced norm of every nonzero element is nonzero. 


[Hint: If Nrea(x) 4 0, then multiply the conjugate of x by an element of F' to 
obtain a multiplicative inverse of x. If Nrea(a) = 0, then x is a zero divisor.] 
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#5. For G, F, O, a, and 6 as in Proposition 6.2.5, show: 
a) G=SL(2,R), 
b) Go is an arithmetic subgroup of G, 
c) ifg eH a with Nrea(g) = 0, then g = 0, and 
d) Go is cocompact in G. 


#6. Let a and 6b be nonzero elements of a field F’. Show t 
nonzero solution of the equation w? — ax? — by? + abz? 
is a nonzero solution of the equation w? 
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hat if there is a 
= 0, then there 


az? = by? =0. 


[Hint: By assumption, there is a nonzero element g of He”, such that 


Nrea(g) = 0. There is some nonzero a € F' + F%, such that 
of ag is zero.| 


#7. For a,b € R, the set 


G={g € Hg” | Nrea(g) = 1} 
is a submanifold of He? Show that the tangent space 
{v € H®” | Rev = 0}. 
[Hint: T,G is the kernel of the derivative d(Nrea)1-] 


#8. Carry out Part (2) of the proof of Proposition 6.2.6. 


the k-component 


TG is 


§6.3. Arithmetic subgroups of SL(2,R) via unitary groups 


Unitary groups provide yet another construction of the cocompact, arithmetic 


subgroups of SL(2,R). In later sections (and later chapters 


), they will join 


quaternion algebras as another essential tool, not an alternative approach. 


In fact, unitary groups can be applied in two different 


ways. The sim- 


pler of the two approaches is based on the fact that SL(2,R) is isomorphic 
to SU(1,1) (see Exercise 2). (This is very similar to the construction in Sec- 


tion 6.1 that is based on the fact that SL(2,R) is isogeno 


us to SO(2, 1).) 


However, the required isogeny has no higher-dimensional analogue, so this 


method will not provide any lattices in SL(n,R) when n > 2 


The following method is much more important, because it will be used 


in later sections to construct arithmetic subgroups of SL(n,] 
just n = 2. 
(6.3.1) Example. Let 
eabeQ, 
e L= QI al CR, 
e O be the ring of integers of L (so O = Zh/a)), 
e 7 denote the nontrivial element of Gal(L/Q), 
e A=diag(b,—-1) = [8 °], and 


R) for all n, not 


e Go = SU(A,7;O) = {9g € SL(2,O) | r(g7) Ag = A} C SL(2,R). 


122 6. EXAMPLES OF ARITHMETIC GROUPS 


If x = (0,0) is the only solution in L? of the equation r(x?) Ax = 0, then Go 
is a cocompact, arithmetic subgroup of SL(2, R). 


Proof. It is not at all difficult to verify that Go is commensurable to an 
arithmetic group constructed from a quaternion algebra in Proposition 6.2.4 
(see Exercise 1), but a direct proof is more instructive. 

To see that Go is an arithmetic subgroup, we apply restriction of scalars. 
The Galois automorphism 7: L — L is Qlinear. Therefore, if we think of 
L as a (2-dimensional) vector space over Q, then 7 is a polynomial with 
Q coefficients (with respect to any basis of L over Q). Since matrix multi- 
plication and transpose are also defined by polynomial functions, this implies 
that if we write g = X + /aY, where X,Y € Matox2(Q), then the equation 
t(g') Ag = A is a system of polynomial equations with Q-coefficients, in 
terms of the matrix entries of X and Y. Therefore, it determines a group 
that is defined over Q. More precisely, letting G = SL(2, R), define: 

e A: L > L? by A(s) = (s, A7(s)), so L = A(O) is a Zlattice in R’, 

and 


© ¢: G> Gx G by $(9) = (9, (97) ~*). 

The import of the above argument is that $(G) is defined over Q, with respect 
to the Qform A(L) of R?. Since it is not difficult to verify that Go = 
p'(p(G)awo)), we see that Go is an arithmetic subgroup of G. 

If Go is not cocompact, then it has a nontrivial unipotent element wu, so 
there exist nonzero x,y € L?, such that uz = x and uy = x+y. Define 
B: L? x L? + L by B(x1,22) = T(x} ) Axa. Since u € Go, the definition of 
Go implies 

B(a,y) = B(uz, uy) = B(az,x+y) = Bia, x) + Bla, y). 
Therefore B(x,x) = 0. By assumption, this contradicts the fact that x # 
0. O 


(6.3.2) Example. The preceding example can be modified, much as in Propo- 
sition 6.2.5, to obtain all of the other cocompact lattices in SL(2,R). Namely, 
replace Q with a totally real number field F' 4 Q, and let: 


e a,b € Ft, such that o(a) < 0 and o(b) < 0, for all nonidentity places 
of F, 


e L=F\/a| CR, and 
e O,7, A, and Go be defined as in Example 6.3.1. 
Then Go is a cocompact, arithmetic subgroup of SL(2, R). 


Proof. Let Or be the ring of integers of F'. From the second paragraph of 
the proof of Example 6.3.1 (with F in the place of Q) we see that Go is the 
Or-points of a certain F-form Gp of SL(2,R). Then restriction of scalars 
(5.5.8) implies that A(Go) is an arithmetic subgroup of |] G’. 


oESe 
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For any nonidentity place o of F’, we have a(a) < 0, so 
Lg = Fe|V/o(a)| =C. 
Then, since o(b) and —1 are both negative, we have 
G? = SU(o(A), tc; C) = SU(diag(o(b), —1), 7c; C) & SU(2) is compact. 
Therefore, all factors of [[,¢g.. G® other than G are compact, so we can mod 
them out, to conclude that Go is an arithmetic subgroup of the group G = 


SL(2, R) (cf. Definition 5.1.19). Furthermore, the existence of compact factors 
implies that the arithmetic subgroup is cocompact (see Corollary 5.5.10). 


Exercises for §6.3. 


#1. Let a,b € Z*, let o: sles — Mat2,2(R) be as in the proof of Propo- 
sition 6.2.4, let O = Zh/a], and let Go be as in Example 6.3.1. Show 


¢(SL(1,H%")) = Go. 


#2. Let 
ea,bEQ, 
° L=Q[y-a], 
e O be the ring of integers of L (so O = Z| /—al i. 
e 7 denote complex conjugation (the only nontrivial element of 


Gal(C/R), and also of Gal(L/Q)), 
e A=diag(b,—1) = [5 ©], and 
e G=SU(A,7;C) ={g € SL(2,C) | r(g7) Ag = A} & SU(1, 1). 
Show that if « = (0,0) is the only solution in L? of the equation 
t(a")Aax = 0, then Go is a cocompact, arithmetic subgroup of G. 


§6.4. Arithmetic subgroups of SO(1, 7) 
(6.4.1) Proposition. Let 
@ Q1,...,Gn be positive integers, and 


e G = SO(a2 — ax? — --- — anz?;R) = SO(1,n). 


n? 


Ifn>4, then Gz is an arithmetic subgroup of G that is not cocompact. 


Proof. Since a1,...,@, > 0 it is obvious that G & SO(1,n). Also, since 
Q1,---,4, € Q, it is clear that G is defined over Q, so Gz is an arithmetic 
subgroup of G. 

Since we are assuming n > 4, a theorem of Number Theory (called 
Meyer’s Theorem) tells us that the equation a,x? +--+ + an22 = x2 has 
a nontrivial integral solution. (This is related to, but more difficult than, the 
fact that every integer is a sum of four squares.) Therefore, G'z is noncocom- 


pact. 
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In most cases, the above construction is exhaustive: 


(6.4.2) Proposition (see Corollary 18.6.3). If n ¢ {3,7}, then the arith- 
metic subgroups constructed in Proposition 6.4.1 are the only noncocompact, 
arithmetic subgroups of SO(1,n) (up to commensurability and conjugates). 


(6.4.3) Remarks. 
1) The case n = 7 is genuinely exceptional: there exist some exotic arith- 
metic subgroups of SO(1,7) (see Remark 18.5.10). 
2) The groups SO(1, 2) and SO(1, 3) are isogenous to SL(2, R) and SL(2, C), 
respectively. Therefore, Propositions 6.1.5 and 6.2.6 describe all of the 


arithmetic subgroups of SO(1, 2). Similar constructions yield the arith- 
metic subgroups of SL(2,C) + SO(1, 3). 


Cocompact arithmetic subgroups of SO(1, 7) can be constructed by using 
an algebraic extension of Q, much as in Example 6.1.2: 


(6.4.4) Proposition. Let 
e F be an algebraic number field that is totally real, 
e O be the ring of integers of F, 
@ Q1,...,4n € O, such that 
° each a; 1s positive, and 
o each a(a;) is negative, for every place o # 1d, and 
e G=SO(ax2 — ax? — --- — anx?;R) & SO(1,n). 


n? 


Then Go is a cocompact, arithmetic subgroup of G. 
This construction is exhaustive when n is even: 


(6.4.5) Proposition (see Corollary 18.6.1). [fn is even, then the arithmetic 
subgroups constructed in Proposition 6.4.4 are the only cocompact, arithmetic 
subgroups of SO(1,n) (up to commensurability and conjugates). 


(6.4.6) Remark. Theoretically, it is easy to tell whether two choices of a1,..., Qn 
give essentially the same arithmetic subgroup (see Exercise 3). 


When n is odd, we can construct additional arithmetic subgroups of 
SO(1,n) by using quaternion algebras. This requires a definition: 
(6.4.7) Definition. Suppose He? is a quaternion algebra over a field F’. 
1) Define 7,.: He? > He? by 
Tr LG + 411+ %oj3 + x3k) =%9+ 211 -—X%oj + x3k. 


This is the reversion anti-involution of E ae (cf. Exercise A2#2). 
2) For A € GL(m, HG) with 7,(A7) = A, let 
SU(A, 73H’) = {g € SL(m, HR”) | r,(g7)Ag = A}. 


Now, here is the main idea of the construction: 
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(6.4.8) Proposition. Let 
e abe QX {0}, with a > 0, 


© Q1,---,4m be invertible elements of H 
each £, 


e A= diag(a1,...,@m) € GL(m, Hg”), 

¢ G=SU(A,7,;Hg’), and 

e O be a Z-lattice in HY: 
Then: 

1) G=SO(p,q), for some p and q with p+q=2m, and 

2) SU(A,7,;O) is an arithmetic subgroup of G. 


a,b 


, such that T,(ae) = ag, for 


, such that O is also a subring. 


Proof. To make things a bit easier, let us assume b < 0 (see Exercise 8). 
Exercise 5 provides an isomorphism ¢: He — Mate x2(R), such that: 

e d(7,(z)) = ¢(z)*, for allxe sae and 

e (x) is symmetric, for all x € H oa such that 7,(a) = x. 


Then ¢(A) is symmetric, and G is isomorphic to SO2,,(¢(A)) (see Exercise 6). 
This establishes (1). 


As a vector space over R, (Hy’ 


b\m 


is isomorphic to R*”. With this 


’ 


identification, and considering (I ea as a vector space over H = via scalar 
multiplication on the right, we have 


GL(m, Hy”) es {! € GL(4m, R) 


g(at) =(g@ )t for all 
Ze (Hy) and ¢ € He? {° 


Since Hy” is dense in E ae we may restrict t to belong to Hg”. This implies 
G is defined over Q, with respect to the Q-form (H or of ( “a hae For this 
Q-form, we have Gz = SU(A,7,;O). This establishes (2). 


(6.4.9) Remark. Since p+ q = 2m must be even, the preceding proposition 
cannot yield any arithmetic subgroups of SO(1,n) unless 1+ is even, which 
means that n is odd. 


Proposition 6.4.8 yields an arithmetic subgroup of some SO(p,q), but 
not necessarily a subgroup of SO(1, 7). Obtaining a particular value of p re- 
quires us to prescribe the number of positive eigenvalues of the symmetric ma- 
trix ¢(A) that appears in the proof. Since ¢(A) is made from ¢(a1),..., (am), 
this is achieved by calculating the number €q,,(az) of positive eigenvalues of 
each ¢(a¢); the formula is in Notation 6.4.10 below. 

However, as in Proposition 6.4.1, Meyer’s Theorem implies that arith- 
metic subgroups obtained in this way are never cocompact (unless G is com- 
pact or m < 2). To construct cocompact lattices, restriction of scalars is 
applied, as usual: choose an extension field F' of Q, and arrange for G° to be 
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compact at all but one place. The outcome of these considerations is stated 
in Proposition 6.4.11 below. 
(6.4.10) Notation (cf. Exercises 9 and 10). Suppose 


e a and b are nonzero elements of R, such that either a or b is positive, 
and 


e x is an invertible element of H ae such that 7.(¢) =a. 


Write « = p+qi+sk, for some p,q,s € R. For convenience, let 


Nap(x) = 2 & = p* — aq? + abs”, 
and note that Na»(x) 4 0 (since x is invertible). Define 
1 if b Nao(x) > 0, 
2 if bNa»(x) < 0, and 
Ea,b(&) = ; b <0 and p > 0, or 
either 
b>0Oand (a+ 1)q+ (a—1)sVb > 0, 
O otherwise. 
(6.4.11) Proposition. Let 
e F be a totally real algebraic number field (such that F 4 Q), 
e a and b be nonzero elements of F', such that, for each place o of F, 
either o(a) or o(b) is positive, 


@ 41,.-.,Am € HY, such that 
© T,(ae) = ag for each £, 
o a(ag) is invertible, for each €, and each place o, 
6 327-4 2abl(ae) = 1, and 
OST ye etayelh) (o(ae)) € {0,2m} for each place o # Id, 


e O be a Z-lattice in HY”, such that O is also a subring, and 
eG= SU (diag(a1, eet yh are clea he 

Then: 
1) G=SO(1,2m — 1)°, and 


2) Go is a cocompact, arithmetic subgroup of G. 


(6.4.12) Proposition. Jf n ¢ {3,7}, then the arithmetic subgroups con- 
structed in Propositions 6.4.4 and 6.4.11 are the only cocompact, arithmetic 
subgroups of SO(n,1) (up to commensurability and conjugates). 


Remark 18.5.10 briefly explains the need to assume n # 7. 
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Exercises for §6.4. 


#1. 


#2. 


#3. 


4A, 


$9. 


#16. 


#7. 


Use restriction of scalars (see Section 5.5) to construct cocompact arith- 
metic subgroups SO(m,n) for all m and n. 


Suppose G is an irreducible subgroup of GL(¢,C). (This means there 
is no nonzero, proper, G-invariant subspace of C’.) Show that if By 
and By are (nonzero) G-invariant quadratic forms on C’, then there 
exists A € C, such that By = ABo. 


[Hint: Let Ai and Az be the symmetric matrices that represent B; and Bo, 
and write A» = Ai L. For any g € G, we have AiL = g’ AiLg = Ai(g~'Lg).] 


Let 
e F, O, a1,...,@,, and G be as in Proposition 6.4.4, 
e T =h-!Goh, where n = diag(1, \/a1,...,./@n), and 
e F’, O',a},...,a),, G’, I’, and h’ be defined similarly. 


Show g~!Ig is commensurable to I’, for some g € O(1,7), if and only 
if there exists \ € F* and g’ € GL(n+ 1, F), such that 


(g')? diag(—1,a1,...,@n) 9’ = Adiag(—1,a/,...,a/,). 


ee 0) 
[Hint: (=) For g' = h’gh~', we have (g')~'SO(B;O)g’ C SO(B’;R), so 
the Borel Density Theorem implies (g’)~'SO(B;R)g’ C SO(B’;R). Apply 
Exercise 2 with G = (g’)~* SO(B;R)q’.] 


Let F, O, a1,...,@n, a4,...,a),, , and I’ be as in Exercise 3. 
Show that if n is odd, and there exists g € O(1, 7), such that gl'g~4 
is commensurable to I’ then 
a eee Qn 
adn 6 (pexye 


/ 
a, °° + An 


[Hint: The discriminant of a quadratic form B(x) on F”*' is defined to 
be the determinant of the Gram matrix of B, with respect to any basis B 
of F”*!. This is not uniquely determined by B, but show that it is well- 
defined up to multiplication by a nonzero square in F'*.] 


Suppose a and 6 are real numbers, such that a > 0 and b < 0. Show 
that there is an isomorphism ¢: H oe — Mate x2(R), such that: 
a) $(tr(x)) = ¢(2)", for all x € H ao and 


b) $(a) is symmetric, for all x € He”, such that 7,(x) = x. 


<— so Jia © | Ie | 0 |b| 
[Hint: Let (i) = | Oi =e and $(j) = | _ I ; | 
Assume the notation of the proof of Proposition 6.4.8. Show G is iso- 
morphic to SOam (d(A)). 


[Hint: Apply the isomorphism ¢ to both sides of the equation 7,(g’)Ag = A.] 


Suppose a and b are nonzero real numbers, such that b > 0, and let 
w= 9 AR Show there is an isomorphism ¢: He? — Mat2.2(R), such 
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a) o(7,(a)) = wo(x)? w, for all 2 ¢ Hy’, and 
b) w@(az) is symmetric, for all « € H os such that 7,(a”) = x. 


— «FO? x2 ive IG 
[Hint: Let $(i) = L i and ¢(j) = | 0 i | 
#8. Prove Proposition 6.4.8 under the additional assumption that b > 0. 
[Hint: Use Exercise 7 and show G ~ SO2m(diag(w¢(a1),...,w(am)).-] 
#9. In the situation of Exercise 5, show that ¢ can be chosen so that if 
x=p+qi+rj+sk is an invertible element of H nee and 7,(x) = x (so 
r = 0), then the number of positive eigenvalues of (x) is 
1 if Na,o(x) <0, 
2 if Nap(x) >0 and p> 0, 


0 otherwise. 


[Hint: Since both eigenvalues of ¢(x) are real (and nonzero), the number of 
positive eigenvalues is determined by the determinant and trace.] 


#10. In the situation of Exercise 7, show that ¢ can be chosen so that if 
x=p+qi+rj-+sk is an invertible element of H i and 7,(xz) = x (so 
r = 0), then the number of positive eigenvalues of w(x) is 
1 if Na,o(2) > 0, 
2 if Na»o(x) <0 and (a+ 1)q+ (a—1)svb > 0, 


0 otherwise. 


[Hint: See the hint to Exercise 9.] 


§6.5. Some nonarithmetic lattices in SO(1, 7) 


Section 6.4 describes algebraic methods to construct all of the arithmetic 
lattices in SO(1,n) (when n 4 7). We now present a geometric method 
that is sometimes able to produce a new lattice by combining two known 
lattices. The result is often nonarithmetic. We assume some familiarity with 
hyperbolic geometry. 


§6.5(i). Hyperbolic manifolds. For geometric purposes, it is more 
convenient to consider the locally symmetric space [\S)", instead of the lat- 
tice [. 

(6.5.1) Definition. A connected, Riemannian n-manifold M is hyperbolic 
if 
1) M is locally isometric to 9)” (that is, each point of M has a neighbor- 
hood that is isometric to an open set in $9”), 
2) M is complete, and 
3) M is orientable. 
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(6.5.2) Other terminology. Many authors do not require M to be com- 
plete or orientable. Our requirement (1) is equivalent to the assertion that 
M has constant sectional curvature —1; some authors relax this to require 
the sectional curvature to be a negative constant, but do not require it to be 
normalized to —1. 


(6.5.3) Notation. Let PO(1,n) = O(1,n)/{+Id}. 
Note that: 
e PO(1,7) is isogenous to SO(1,7n), 
e PO(1,n) = Isom("”), and 
e PO(1,n) has two connected components (one component consists of 
orientation-preserving isometries of §)”, and the other consists of orientation- 
reversing isometries). 


The following observation is easy to prove (see Exercise 1). 


(6.5.4) Proposition. A connected Riemannian manifold M of finite volume 
is hyperbolic if and only if there is a torsion-free lattice T in PO(1,n)°, such 
that M is isometric to T\S”. 


§6.5(ii). Hybrid manifolds and totally geodesic hypersurfaces. 
We wish to combine two (arithmetic) hyperbolic manifolds M; and Mp2 into 
a single hyperbolic manifold. The idea is that we will choose closed hypersur- 
faces Cy and C2 of M; and Mo, respectively, such that C is isometric to C2. 
Let M. F be the manifold with boundary that results from cutting M; open, 
by slicing along C; (see Figure 6.5A and Exercise 3). 


Figure 6.5A. Cutting open a manifold by slicing along a 
closed hypersurface (dashed) results in a manifold with 
boundary. 


The boundary of Mj (namely, two copies of C) is isometric to the bound- 
ary of M3 (namely, two copies of C2) (see Exercise 3). So we may glue WM; and 
Ms together, by identifying OM; with OM§ (see Figure 6.5B), as described in 
the following well-known proposition. 

(6.5.5) Proposition. Suppose 
e M; and M3 are connected n-manifolds with boundary, and 


e f': OM{ > OMS is any homeomorphism. 
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| 

<< NN ee 
| 

7 — My Up M3 


Figure 6.5B. Gluing Mj to M3 along their boundaries re- 
sults in a manifold without boundary. 


Define a topological space Mj Ur M3, by gluing M; to Mé along their bound- 
aries: 
e let M, U M3 be the disjoint union of Mj and M3, 
e define an equivalence relation on Mi U Ms by specifying that we have 
mw f'(m), for everym  € OM, and 
e let Mi Up MS = (Mj U M3)/~ be the quotient of Mj U M3 by this 
equivalence relation. 
Then Mj Ur M5 is an n-manifold (without boundary). 


(6.5.6) Corollary. Suppose 
e M, and Mz are connected, orientable n-manifolds, 
e C; is a closed (n — 1)-submanifold of M,;, and 
e f: Ci > C2 is any homeomorphism. 
Define Mi# M2 = M; Uy M3, where 
° M; is the manifold with boundary that is obtained by slicing M; open 
along C;, and 
e f’: OM{ + OM3 is defined by f'(c,k) = (f(0),k), under a natural 
identification of OM; with C; x {1,2}. 
Then Mi # Mo is a (connected) n-manifold (without boundary). 
Furthermore, 
1) Mi# M2 is compact if and only if both My and Mz are compact, and 
2) Mi#t¢ Me is connected if and only if either My \ Cy or M2 \ C2 is 
connected. 


(6.5.7) Other terminology. Gromoy and Piatetski-Shapiro [5] call the man- 
ifold M,# M2 a hybrid of M, and Mo, and they call this construction in- 
terbreeding. 


Unfortunately, gluing two Riemannian manifolds together does not always 
result in a Riemannian manifold (in any natural way), even if the gluing map f 
is an isometry from 0M; to 0M3. 


(6.5.8) Example. Let Mj = Mj be the closed unit disk in R?, and let 
f: OM, — OMs be the identity map. Then M; Uy M3 is homeomorphic 


6.5. SOME NONARITHMETIC LATTICES IN SO(1, n) 131 


to the 2-sphere $7. The Riemannian metrics on Mj and Mé are flat, so the 
resulting Riemannian metric on S? would also be flat. However, there is no 
flat Riemannian metric on S?. (This follows, for example, from the Gauss- 
Bonnet Theorem.) 


We can eliminate this problem by putting a restriction on the hypersur- 
face. 


(6.5.9) Definition. Let M be a hyperbolic n-manifold. A totally geodesic 
hypersurface in M is a (closed, nonempty) connected submanifold C of M, 
such that, for each point c of C, there are 

e a neighborhood U of cin M, 

e apoint z in 9”-1={vEH" |v, =0}, 

e a neighborhood V of x in 9”, and 

e a Riemannian isometry g: U > V, such that g(UNC)=VAH""1. 


(6.5.10) Remark. If C is a totally geodesic hypersurface in a hyperbolic n- 
manifold of finite volume, then there are 


e a lattice [ in PO(1,n), and 
e an isometry f: M3 T\H", 
such that f(C) is the image of §"~+ in T\H”. 


(6.5.11) Proposition. Jf 
e M, and M2 are hyperbolic n-manifolds, 
e C; is a totally geodesic hypersurface in M;, 
e f: Cy > Cs, is a Riemannian isometry, and 
e M, and Mz have finite volume, 


then M,# ¢M2 is a hyperbolic n-manifold of finite volume. 


Proof. The main issue is to show that each point of 0M; has a neighbor- 
hood U in Mj Uy M3, such that U is isometric to an open subset of 9”. This 
is not difficult (see Exercise 4). 

We have vol(M,# M2) = vol(M1) + vol(M2) < oo. 

If M1#7M2 is compact, then it is obviously complete. More generally, 
since Mj and Mj are complete, and their union is all of Mj Us M3, it seems 
rather obvious that every Cauchy sequence in Mj Uy M3 has a convergent 
subsequence. Hence, it seems to be more-or-less obvious that Mj Ur M3 is 
complete. 

Unfortunately, if M17 M2 is not compact, then there is a technical dif- 
ficulty arising from the possibility that, theoretically, the Riemannian isome- 
try f may not be an isometry with respect to the topological metrics that C1 
and Cy inherit as submanifolds of M, and Mg, respectively. We will ignore 
this issue. 
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The following lemma describes how we will construct the totally geodesic 
hypersurface C;. 


(6.5.12) Lemma. Suppose 

e I is a torsion-free lattice in PO(1,n)°, 

e C is the image of 9"~+ in T\H”, 

e 7:8" +H” is the reflection across "1, so 

T0095 U15-0545.Un) = (00,015.44 23 VNR) 
e PM PO(1,n — 1) ts a lattice in PO(1,n — 1), and 
e IT is contained in a torsion-free lattice I’ of PO(1,n)°, such that I’ is 
normalized by T. 

Then C is a totally geodesic hypersurface in T\H", and C has finite volume 
(as an (n — 1)-manifold). 


Proof. It is clear, from the definition of C’, that we need only show C’ is a 
(closed, embedded) submanifold of [\”. 

Let To = {7 ET | 7(H"71) = Hr-1}. (Then PN PO(1,n — 1) is a 
subgroup of index at most two in T9.) The natural map 

6: To\H" 1 3 T\H” 

is proper (cf. Exercise 4.4#43), so C, being the image of ¢, is closed. 

Because ¢ is obviously an immersion (and is a proper map), all that 
remains is to show that ¢ is injective. This follows from the assumption on I” 
(see Exercise 5). Oo 


§6.5(iii). Construction of nonarithmetic lattices. The following 
theorem is the key to the construction of nonarithmetic lattices. We postpone 
the proof until later in the section (see Subsection 6.5(iv) and Exercise 11). 


(6.5.13) Definition. A hyperbolic n-manifold of finite volume is arithmetic 
if the corresponding lattice [ in PO(1, 7) (see Proposition 6.5.4) is arithmetic. 
(Note that T is well-defined, up to conjugacy (see Exercise 2), so this definition 
is independent of the choice of I.) 
(6.5.14) Theorem. Suppose 

e M, and M2 are hyperbolic n-manifolds, 

e C; is a totally geodesic hypersurface in M;, 


f: Ci > C2 is a Riemannian isometry, 


M, and Mg have finite volume (as n-manifolds), 


Cy and C2 have finite volume (as (n — 1)-manifolds), and 
each of My \ Cy and Mz ~ C2 is connected. 


If the hyperbolic manifold M,## Mg is arithmetic, then M,# M2 is commen- 
surable to M,; that 1s, there are 
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1) a finite cover M of My Ur Mz, and 
2) a finite cover M, of M,, 
such that M is isometric to M,. 


(6.5.15) Corollary. In the situation of Theorem 6.5.14, if the hyperbolic man- 
fold My #f Mz 1s arithmetic, then M, is commensurable to Mo. 


Proof. From Theorem 6.5.14, we know that M,# M2 is commensurable 
to M,. By interchanging M; and Mg, we see that M,# Mp2 is also commen- 
surable to My. By transitivity, | is commensurable to M9. 


(6.5.16) Corollary. There exist nonarithmetic lattices Tepce and Pron in 
SO(1,n), such that Tepet is cocompact, and Tyon is not cocompact. 


Proof. We construct only [non. (See Exercise 6 for the construction of Depet, 
which is similar.) 
Define quadratic forms B,(x) and Bo(x) on Q”*! by 


By (x) = 09 — 2] —@3-— +++ 2, 1-2, 
and 

Bo (a) = 24 —a7 = a3 a4 = We 
Let 


e I; © SO(B;;Z), 
e Ty &h-!SO(Bo;Z)h, where 
h = diag(1,1,...,1, V2) € GL(n+1,R), 


e Mj =Tj\9", 

e C; be the image of §”~! in M;, and 

e fT; =; NSO(1,n—1). 

Then Proposition 6.4.4 tells us that [; and Pz are noncocompact (arithmetic) 
lattices in SO(1,n). By passing to finite-index subgroups, we may assume T'; 
and TI. are torsion free (see Theorem 4.8.2). Therefore, M, and My, are 
hyperbolic n-manifolds of finite volume (see Proposition 6.5.4). 

Because T; ~ SO(1,n — 1;Z) is a lattice in SO(1,n — 1), and SO(B;; Z) 
is normalized by the involution 7 of Lemma 6.5.12, we know C; is a totally 
geodesic hypersurface in M; that has finite volume (see Lemma 6.5.12). 

Let us assume that M, \ C; and M2 \ C2 are connected. (See Exercise 8 
for a way around this issue, or note that this hypothesis can be achieved by 
passing to finite covers of M, and Mp.) 

We know that ise & Ts (since both groups are commensurable to SO(1, n— 
1;Z)). By taking a little bit of care in the choice of Ty and lz, we may arrange 


~ 


that Ty = rs (see Exercise 9). Then 
CS hes a = T\5" 2 = Co, 
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so there is an isometry f: Cy > Co. 

If n is odd, then M, is not commensurable to Mo (see Exercise 6.474), 
so Corollary 6.5.15 implies that M,#,Mo is not arithmetic; therefore, the 
corresponding lattice [yon is not arithmetic (see Definition 6.5.13). When 
n is even, an additional argument is needed; see Exercise 10. LC] 


§6.5(iv). Proof of Theorem 6.5.14. Let us recall the following lemma, 
which was proved in Exercise 5.177. 


(6.5.17) Lemma. [f 
e G has no compact factors, 
e I; and TY. are arithmetic lattices in G, and 
e 1, N12 is Zariski dense in G, 
then Ty is commensurable to To. 
(6.5.18) Definition. Let MW’ be a Riemmanian n-manifold with boundary. 
We say that M’ is a hyperbolic manifold with totally geodesic boundary 
if 
1) M’ is complete, 
2) each point of M’\ 0M’ has a neighborhood that is isometric to an open 
set in 9)”, and 
3) for each point p of OM’, there are 
a neighborhood U of p in M’, 
a point 7 in H"-1={v EH” |v, =0}, 
a neighborhood V of x in 9”, and 
an isometry g: U + V™, where 


VT ={vEV]u>0}. 
(Note that g(U NOM") =Vn9""1,) 


The following is a generalization of Theorem 6.5.14 (see Exercise 11). 


(6.5.19) Theorem. Suppose 
e M, and Mo are hyperbolic n-manifolds, 


° M; 1s a connected, n-dimensional submanifold of Mj; with totally geo- 
desic boundary, 


f': OM; > OMS is an isometry, 


M, and Mg have finite volume (as n-manifolds), 


° OM; has only finitely many components, and 

e OM, and OM35 have finite volume (as (n — 1)-manifolds). 
If the hyperbolic manifold M{ Ur Mg is arithmetic, then Mj Us M5 is com- 
mensurable to My. 


Proof. 
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e Let M= M Ufr M3. 

e Write M =[\S”, for some torsion-free lattice T in PO(1, 7). 

e Let ¢: 9” > M be the resulting covering map. 

e Let B = ¢-1(0M{). Because M} has totally geodesic boundary, we 

know that B is a union of disjoint hyperplanes. (That is, each compo- 
nent of B is of the form g(”~'), for some g € O(1,n).) 

e Let V be the closure of some connected component of 9” \ B that 

contains a point of é~!(M{). 

e Let 

M={yeT|wW=V}={7 ETL | interior(yV NV) 40} 
(see Exercise 12), so Mj = @(V) = I’\V. 

By definition, V is an intersection of half-spaces, so it is (hyperbolically) 
convex; hence, it is simply connected. Therefore, V is the universal cover 
of Mj, and I” can be identified with the fundamental group of M;. 

Since Mi C Mj, we may define [;, ¢1, B1,Vi,T) as above, but with M, 
in the place of M. From the uniqueness of the universal cover of MM), we 
know that there is an isometry 7: V — V,, and an isomorphism y,.: TP’ 31, 
such that w(yv) = w.(y) v(v), for all y € I’ and v € V. Since w extends 
to an isometry of §)”, we may assume (after replacing Ty with w~!'T,w) that 
V = V, and vy, = Id. Hence IY = 1 CIMT. It suffices to show (after 
replacing I’ by a conjugate subgroup) that the Zariski closure of I’ contains 
PO(1,7)°, for then Lemma 6.5.17 implies [ is commensurable to [. 


Claim. We may assume that the Zariski closure of 1’ contains PO(1,n)°. 
We may assume §)”~! is one of the connected components of OV. Since 0M{ 
has finite volume, this means that 
IY NSO(1,n — 1) is a lattice in PO(1,n — 1). (6.5.20) 
Let I’ be the Zariski closure of I’. From (6.5.20) and the Borel Density Theo- 
rem (4.5.6), we know that I’ contains PO(1,n—1)°. Then, since PO(1,n—1)° 
is a maximal connected subgroup of PO(1,n) (see Exercise 13), we may as- 
sume that F7” = PO(1,n —1)°. (Otherwise, the claim holds.) Because I” 
has finite index in I’ (see A4.6), this implies that PO(1,n — 1)° contains a 
finite-index subgroup of I’. In fact, 
{7 € I" |yH =H } has finite index in I’, (6.5.21) 
for every connected component H of OV. et 


This will lead to a contradiction. 


Case 1. Assume OV is connected. We may assume OV = §)"~!. Then, by 
passing to a finite-index subgroup, we may assume that I’ Cc PO(1,n — 1) 
(see 6.5.21). Define g € Isom(§)”) by 


g(v1, V2,---,Un) = (—V1, v2, ---,Un)- 
Then 
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e g centralizes I’, and 
e 9” =VUg(V). 
Since I’\V & My; has finite volume, we know that I’\g(V) also has finite 
volume. Therefore 
I’\ 8)" = (I’\V) U (T’"\9(V)) 
has finite volume, so I” is a lattice in PO(1,n). But this contradicts the Borel 
Density Theorem (4.5.6) (since [’ C PO(1,n — 1)). 


Case 2. Assume OV is not connected. Let H; and H2 be two distinct con- 
nected components of OV. Replacing I’ by a finite-index subgroup, let us 
assume that each of H, and Hg is invariant under I” (see 6.5.21). 

To simplify the argument, let us assume that OM; is compact, rather 
than merely that it has finite volume. (See Exercise 14 for the general case.) 
Therefore, IH; is compact, so there is a compact subset C' of Hy, such that 
I’C =H 1- Let 

6 = min{ dist(c, H2)|cEC}>0. 
Because I’ acts by isometries, we have 6 = dist(H,, Hz). Now, since §)” is 
negatively curved, there is a unique point p in H,, such that dist(p, H2) = 6. 
The uniqueness implies that p is fixed by every element of I’. Since T' acts 
freely on 9” (recall that it is a group of deck transformations), we conclude 
that I’ is trivial. This contradicts the fact that I'’\H, is compact. (Note that 
H, = 9"~' is not compact.) O 


Exercises for §6.5. 


#1. Prove Proposition 6.5.4. 


#2. Show that if [, and P2 are torsion-free lattices in PO(1,7), such that 
T',\” is isometric to [2\”, then T; is conjugate to Ta. 
(Hint: Any isometry ¢: T1\9”" > T2\S” lifts to an isometry of §)”.| 


#3. Let C be a closed, connected hypersurface in an orientable Riemannian 
manifold M, and let M’ be the manifold with boundary that results 
from cutting M open, by slicing along C’. Show: 

a) If C is orientable, then the boundary of M is two copies of C. 

b) If C is not orientable, then the boundary is the orientable double 
cover of C. 

c) If C is isometric to a closed, connected hypersurface Cp in an 
orientable Riemannian manifold Mo, and Mj is the manifold with 
boundary that results from cutting Mo open, by slicing along Co, 
then the boundary of M’ is isometric to the boundary of M5. 


#4. For M,, Mo, and f as in Exercise 4, show that if p € OM{, then p has 
a neighborhood U in M; Uy M3, such that U is isometric to an open 
subset of )”. 


#5. 


#6. 


#7. 


HH, 


9, 


410. 


#11. 


412. 


#13. 
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[Hint: Find a ball V around a point « in §"~', and isometries g1: U1 > Vt 
and g2: U2 + V~, where U; is a neighborhood of p in Mj, with glam = 


(920 Alam] 
For ¢: T9\”"~! > T\8”, as defined in the proof of Lemma 6.5.12, 
show that ¢ is injective. 


[Hint: Suppose yx = y, for some y €T and x,y € 9”~*. Then y~'ryr is an 
element of I’ that fixes x, so it is trivial. Hence, the fixed-point set of T is 
y-invariant. | 


Assume n is odd, and construct a cocompact, nonarithmetic lattice [ 
in SO(1,n). 
[Hint: Let F = Q[V/2], define Bi(x) = V2x3 — 2] — 3 —----— 2? _, —- 2? 


and Bo() = /2x3 — 27 — 23 —--- — 22_, — 322.4,, and use the proof of 
Corollary 6.5.16.] 


In the notation of the proof of Corollary 6.5.16, assume that My, \ Cy 
and Mz C2 are not connected; let Mj; be the closure of a component of 
M;\C;. Show that if f’: C; + C; is any isometry (and n is odd), then 
Mj Ur M3 is a nonarithmetic hyperbolic n-manifold of finite volume. 


Eliminate the assumption that M, \ C, and M2 x C2 are connected 
from the proof of Corollary 6.5.16. 

[Hint: Define Bs(x) = 2% — x7 — «3 —---— 22_, — 3x2. If M; \ C; has the 
same number of components as M, \ Cy (and j 4 k), then either Exercise 7 
or the proof of Corollary 6.5.16 applies. | 


For By(x) and Bo(x) as in the proof of Corollary 6.5.16, show that 
there are finite-index subgroups ['; and [2 of SO(B;; Z) and SO(B2; Z), 
respectively, such that 

a) T, and [2 are torsion free, and 

b) Ty a) SO(1,n = 1) = i a) SO(1,n _ 1). 
[Hint: Let [; = AN SO(B;;Z), where A is a torsion-free subgroup of finite 
index in SL(n + 1, Z).] 


In the notation of the proof of Corollary 6.5.16, show that if n is even 
(and n > 4), then [yon is not arithmetic. 

[Hint: If Tnon is arithmetic, then its intersection with SO(1,n — 1) is arith- 
metic in SO(1,n — 1), and n — 1 is odd.] 

Derive Theorem 6.5.14 as a corollary of Theorem 6.5.19. 

[Hint: Apply Theorem 6.5.19 to M; = Mj#;, Mj, where re fi: C; > C; is the 


identity map. Note that M; is a double cover of M;, so M; is commensurable 
to M;.| 


For I’ and V as in the proof of Theorem 6.5.19, let V be the interior 
of V, and show, for each y €T, that if yVNV £9, then yVV =V. 


Show that if H is a connected subgroup of PO(1,n) that contains 
PO(1,n —1)°, then H = PO(1,n—-1)°. 
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#14. Eliminate the assumption that 0M; is compact from Case 2 of the proof 
of Theorem 6.5.19. 
[Hint: The original proof applies unless dist(H1, H2) = 0, which would mean 
that H; and H2 intersect at infinity. This intersection is a single point, and 
it is invariant under I’, which contradicts the Zariski density of I’.] 


§6.6. Noncocompact arithmetic subgroups of SL(3, R) 


We saw in Proposition 6.1.5 that SL(2, Z) is essentially the only noncocom- 
pact, arithmetic subgroup of SL(2,R). So it may be surprising that SL(3, Z) 
is not the only one in SL(3, R). 


(6.6.1) Proposition. Let 
e L be a real quadratic extension of Q, so L= Ql] , for some square- 
free positive integer r > 2, 
e o be the nontrivial Galois automorphism of L, 


e o be the automorphism of Mat3x3(L) induced by applying o to each 
entry of a matrix, 


e [ =SU(J3,0;Z]/r]) = {9 € SL(3, Z|V7]) | (97) J3 9 = Js }. 
Then: 
1) T is an arithmetic subgroup of SL(3,R), 


2) T is not cocompact, and 


3) no conjugate of T is commensurable to SL(3, Z). 


Proof. (1) This is a special case of Proposition 18.5.7(6), but we provide a 
concrete, explicit proof (using the methods of Sections 5.4 and 5.5). 
Define 


e A: L? > R® by A(v) = (v, Jz o(v)), 

e Vg = A(L), 

e £L=A(Z{yr]?), and 

e p: SL(38,R) > SL(6,R) by 
p(A)(v, w) = (Av, (AT)~!w) for v,w € R®. 


Then 

e Vo is a Q form of R® (cf. Exercise 5.575(b)), 
L is a Z-lattice in Vo (cf. Exercise 5.54£5(c)), 
e pis a homomorphism, 


p(SL(3,R)) is defined over Q (with respect to the Q-form Vg) (see 
(6.6.2) below), and 


T= {g € SL(3,R) | p(g)L = L} (cf. Exercise 5.5#1). 
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Hence, Proposition 5.4.5(1) (together with Theorem 5.1.11) implies that T is 
an arithmetic subgroup of SL(3, R). 
Now let us show that 
p(SL(3, R)) is defined over Q. (6.6.2) 
This can be verified directly, by finding appropriate Q-polynomials, but let 
us, instead, show that p(SL(3, Rg is dense in p(SL(3, R)). 

Define U, as in (6.6.3) below, but allowing a,b,c to range over all of Q, 
instead of only 2Z. Then p(U;)Vo C Vo (see Exercise 1), so we have p(U;) C 
p(SL(3, Ro: Furthermore, U, is dense in 

1 x x 
= ) 1 | . 
001 
Similarly, there is a dense subgroup U2 of U?, with p(U2) C p(SL(3,R)), 
(see Exercise 2). Since (U,U?) = SL(3,R), we know that (U;,U2) is dense in 
SL(3, R), so p(SL(3, R)g is dense in p(SL(3,R)). Therefore p(SL(3,R)) is 
defined over Q (see 5.1.8). 
(2) By calculation, one may verify, directly from the definition of I’, that 
the subgroup 
E atb/r —(a? —rb?)/2+ oy 
Up = i 1 —a+ by/r a, b,c € 2Z (6.6.3) 
1 | 


0 0 


is contained in I’. Then, since every element of Up is unipotent, it is obvious 
that [ has nontrivial unipotent elements. So the Godement Criterion (5.3.1) 
implies that G/T is not compact. 

(3) We sketch a proof. Choose an element w € Z[yr], such that 
o(w) = 1/w. Then diag(w,1,w') is a hyperbolic element of I that nor- 
malizes the maximal unipotent subgroup Up. On the other hand, it is easy 
to see that if UZ is any subgroup of SL(3,Z) that is commensurable to the 
maximal unipotent subgroup Uz, then Uz has finite index in Ngr:3,z)(Uz)- 
Since all of the maximal unipotent subgroups of SL(3,Z) are conjugate (up 
to commensurability) under SL(3,Q), this implies that no conjugate of T is 
commensurable to SL(3, Z). 

Here is a more complete argument that is based on the notion of Q-rank, 
which will be explained in Chapter 9. Define a nondegenerate o-Hermitian 
form B(x, y) on L® by B(x, y) = o(x7) J3y. Then v = (1,0,0) is an isotropic 
vector for B (i.e., B(v,v) = 0). On the other hand, because B is nonde- 
generate, the dimension of the orthogonal complement of any subspace is 
equal to the codimension of the subspace. Since L® is 3-dimensional, this 
implies there is no 2-dimensional subspace that consists entirely of isotropic 
vectors. Therefore, rankgI = 1 (cf. Example 9.1.5(2)). However, we have 
rankg SL(3, Z) = 2 (cf. Example 9.1.5(1)). Two lattices with different Q-ranks 
cannot be conjugate. (They cannot even be abstractly commensurable.) 
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(6.6.4) Remarks. 


1) 


2) 


From Proposition 6.6.1(3), we know that none of the arithmetic sub- 
groups in Proposition 6.6.1 are conjugate to a subgroup that is com- 
mensurable to SL(3, Z). 

Indeed, let X = SL(3, R)/SO(3) be the symmetric space associated 
to SL(3,R). Theorem 2.2.8 implies that if [ is one of the arithmetic 
subgroups constructed in Proposition 6.6.1, then the geometry of the 
locally symmetric space ['\X is very different from that of SL(3,Z)\X. 
Namely, ['\X is only mildly noncompact: it merely has cusps, which 
means that its asymptotic cone is a union of finitely many rays. In 
contrast, the asymptotic cone of SL(3, Z)\X is a 2-complex, not just a 
union of rays. Even from a distance, [\X and SL(3,Z)\X look com- 
pletely different. 


Different values of r always give essentially different arithmetic sub- 
groups (see Exercise 4), but this is not so obvious. 


The classification results in Chapter 18 imply that these are the only 
arithmetic subgroups of SL(3,R) that are not cocompact: 


(6.6.5) Proposition (see Proposition 18.6.4). SL(3,Z) and the arithmetic 
subgroups constructed in Proposition 6.6.1 are the only noncocompact arith- 
metic subgroups of SL(3,R) (up to commensurability and conjugates). 


Exercises for §6.6. 


oH. 


#2. 


#3. 


HA. 


For U,, p, and Vg as in the proof of Proposition 6.6.1(1), show that 
p(U1) Vo C Va. 


In the notation of the proof of Proposition 6.6.1, find a dense sub- 


group U2 of 
eto] 
* 10], 
x «1 


Ve 
Assume the notation of the proof of Proposition 6.6.1, and let G = 
p(SL(3,R)). 
a) Show that G is quasisplit. That is, show that some Borel sub- 
group of G is defined over Q. 
b) Show that every proper parabolic Q-subgroup of G is a Borel sub- 
group of G. 


[Hint: Let B be the group of upper-triangular matrices in SL(3,R). Then 
B is a Borel subgroup of SL(3,R), and p(B) is defined over Q.] 


such that p(U2) C p(SL(3, 


A 


Let [, and I’ be noncocompact arithmetic subgroups of SL(3, R) that 
correspond to two different values of r, say ry; and rg. Show that I, is 
not commensurable to any conjugate of T9. 


[Hint: There is a diagonal matrix in IT; whose trace is not in Z| /ra| | 
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§6.7. Cocompact arithmetic subgroups of SL(3, R) 


Example 6.3.2 used unitary groups over a totally real extension to construct 
cocompact, arithmetic subgroups of SL(2,R). The same technique can be 
applied to SL(3, R): 


(6.7.1) Proposition. Let 
e F be a totally real algebraic number field, such that F 4 Q, 
e t,a,b¢ F, such that 
o t,a,b >0, but 
o a(t),a(a),o(b) < 0 for every place o ¥ Id, 
o L=FiVil, 


e 7 be the Galois automorphism of L over F, 


e O be the ring of integers of L, and 
eT= SU (diag(a, b,-1),7;O). 
Then T is a cocompact, arithmetic subgroup of SL(3,R). 


Here is a specific example: 


(6.7.2) Corollary. Let 
et= v2, 
F = Q{t] = Qlv2], 
b= F[ vil = OW], 
e 7 be the Galois automorphism of L over F, 
© O= ZW 2| be the ring of integers of L, and 
e 1 = SU(Id3x3,7; 0). 
Then T is a cocompact, arithmetic subgroup of SL(3,R). 


(6.7.3) Remark. It is necessary to assume F’ 4 Q in Proposition 6.7.1 (in other 
words, there is no analogue of Example 6.3.1 for SL(3,R)), because unitary 
groups over Q yield only noncompact lattices in SL(3,R) (as in Proposi- 
tion 6.6.1), not cocompact ones (see Exercise 1). 


Here is a quite different construction (not using unitary groups) that 
yields additional examples of cocompact, arithmetic subgroups. See Exam- 
ple 6.7.6 for explicit examples of L and p that satisfy the hypotheses. 


(6.7.4) Proposition. Let 
e L be a cubic, Galois extension of Q (that is, a Galois extension of Q, 
such that |L : Q| = 3), 
e o be a generator of Gal(L/Q) (note that Gal(L/Q), being of order 3, is 
cyclic), 
e O be the ring of integers of L, 
epeZ*, 
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e ¢: L? + Mat3x3(L) be given by 


x y od 
d(z,y,z)= | po(z) a(x) oly) |, (6.7.5) 
por(y) por(z) (x) 
and 
e T={y7e (0%) | dety = 1}. 


Then: 
1) [T is an arithmetic subgroup of SL(3,R). 
2) T is cocompact if and only if p 4 ta(t) o7(t), for all t € L. 


Proof. (1) It is easy to see that: 

e LCR (see Exercise 2). 

e ¢(L?) and ¢(O0%) are subrings of Mat3,3(L) (even though ¢ is not a 

ring homomorphism). 

e ¢(L°) is a Q- form of Mat3x3(R). 

e ¢(O%) is a Zlattice in o(L?). 

e If we define p: Mat3x3(R) + Endg(Mat33(R)) by p(g)(v) = gu, then 

p(SL(3, R)) is defined over Q (with respect to the Q-form ¢(L*) (see Ex- 
ercise 3)). 
e T= {g €SL(3,R) | 9 d(O*) = 9(O%) }. 
So I is an arithmetic subgroup of SL(3,R) (see Proposition 5.4.5(1)). 

(2 <=) If SL(3, R)/T is not compact, then the Godement Criterion (5.3.1) 
tells us there is a nontrivial unipotent element u in I’. This means 1 is an 
eigenvalue of u (indeed, it is the only eigenvalue of u), so there is some nonzero 
v € R? with uv = v. Hence (u—1)v = 0. Since u £ Id and v ¥ 0, we conclude 
that ¢(L?) has a nonzero element that is not invertible. 

Hence, letting D = ¢(L°), it suffices to show that every nonzero element 
of D is invertible. (That is, D is a “division algebra”) For convenience, define 
N: LQ by N(t) = to(t) o?(t). (In Algebraic Number Theory, N is called 
the “norm” from L to Q.) We know that p 4 N(t), for all t € L. It is easy to 
see that N(tit2) = N(ti) N(t2). 

Note that if xyz = 0, but (x,y,z) 4 (0,0,0), then ¢(z,y, z) is invert- 
ible. For example, if z = 0, then det d(z,y,z) = N(x) +pN(y). Since 
p# N(-2/y) = —N(a)/N(y) (assuming y 4 0), we see that det (xz, y, z) £0, 
as desired. The other cases are similar. 

For any x,y,z € L, with z 40, we have 

a (1. -—*..,0) (2,9, 2) = 00, *, *) 
is invertible, so $(x, y, z) is invertible. 

(2 =) If p = to(t)o7(t), for some t € L, then ¢(L?) = Mat3x3(Q) 
(see Exercise 5). From this, it is easy clear that ¢(O?) contains nonidentity 
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unipotent matrices. Since the determinant of any unipotent matrix is 1, these 
unipotents belong to [. Therefore [’ is not cocompact. 


(6.7.6) Example. Let 

e ¢ = 2cos(27/7), 

e L=Q¢], and 

e p be any prime that is congruent to either 3 or 5, modulo 7. 
Then 

1) Lis a cubic, Galois extension of Q, and 

2) p#to(t)o7(t), for all t € L, and any generator o of Gal(L/Q). 
To see this, let w = e?*’/7 be a primitive 7" root of unity, so ¢ = w+w®. Now 
it is well known that the Galois group of Q|w] is cyclic of order 6, generated 
by T(w) = w® (see Proposition B3.4). So the fixed field L of 7? is a cyclic 
extension of degree 6/2 = 3. 

Now suppose ta(t) o?(t) = p, for some t € L*. Clearing denominators, 

we have sa(s) o7(s) = pm, where 

emeZ*, 

es=atdwt+w®*) +cw+w®)?, with a,b,c € Z and pt gcd(a, b,c). 
Replacing w with the variable x, we obtain integral polynomials s;(x), s2(x), 
and s3(), such that 

Zy|a] 

(x6 + a> +-+-41) 
This implies that 2° +a°+---+1 is not irreducible in Z,[z]. This contradicts 
the choice of p (see Exercise 6). 


(6.7.7) Remark. The famous Kronecker-Weber Theorem tells us that if D is 
a Galois extension of Q, with abelian Galois group, then L is contained in 
an extension obtained by adjoining an nth root of unity to Q (for some n). 
(Warning: this does not hold for abelian extensions of algebraic number fields 
other than Q.) As a very special case, this implies that all of the cubic, Galois 
extension fields L of Q can be constructed quite explicitly, in the manner of 
Example 6.7.6: 


e Choose n € ZT, such that y(n) is divisible by 3 (where 
p(n) =#{k|1<k <n, ged(k,n) =1} 


is the Euler y-function). 
Qni/n 


81(X)S2(x)s3(4) = pm = 0 in 


e Let w=e be a primitive n* root of unity. 

e Let H be any subgroup of index 3 in the multiplicative group (Z,,)* of 
units modulo n. 

e Let € = yey w* = Vey 008(20k/n). 

e Let L = Q¢]. 
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We have now seen that cocompact arithmetic subgroups of SL(3, R) can be 
constructed by two different methods: some are constructed by using unitary 
groups (as in Proposition 6.7.1) and others are constructed by using “division 
algebras” D = ¢(L°) (as in Proposition 6.8.8). We will see in the following 
section that these two methods can be combined: some cocompact arith- 
metic subgroups are constructed by using both unitary groups and division 
algebras. The classification results in Section 18.4 show that all cocompact 
arithmetic subgroups of SL(3,R) can be obtained from these methods, using 
either unitary groups, division algebras, or a combination of the two (perhaps 
also combined with restriction of scalars). The same is true for the cocompact 
arithmetic subgroups of any SL(n,R), with n > 3. 


Exercises for §6.7. 


#1. 


#2. 
#3. 


4A, 


#69. 


Assume the situation of Proposition 6.7.1, except that F = Q. More 
precisely, let 
a, b,c € Q (all nonzero), 
L be a real quadratic extension of Q, 
T be the Galois automorphism of L over Q, 
O be the ring of integers of L, 
A = diag(a, b,c), and 
e [ =SU(A,7; 0), so T is an arithmetic subgroup of SL(3, R). 
Show that I is not cocompact. 


[Hint: The equation r(a* ) Aa = 0 for x € L? can be considered as an equation 
in 6 variables over Q, so the Number Theory fact mentioned in the proof of 
Proposition 6.4.1 implies it has a nontrivial solution.| 


Let L be a Galois extension of Q, with |Z: Q| odd. Show LCR. 


Assume the notation of the proof of Proposition 6.7.4. For h € L?, 
define T;, € Endy (Mat3,3(R)) by T;,(v) = $(h) v. 
a) Show that $(h) € Endg (Mats 3( Ro where the Q-form is in- 
duced by the Q-form ¢(L?) of Mat3,3(R). 
[Hint: Show $(h) 6(L°) C (L?).] 
b) Show that p(Mat3,3(R)) is the centralizer of {T), | h € L* }. 
c) Show that p(SL(3,R)) is defined over Q. 


Aa 


In the notation of Proposition 6.7.4, show that if p = to(t) o?(t), then 
the element o(1, 1/t, 1/(to(t))) of ¢(L?) is not invertible. 


In the notation of Proposition 6.7.4, show that if p = ta(t)0o7(t), for 
some t € L, then 6(L?) & Mat3x3(Q). 

[Hint: { (a, to(a), to(t)o?(a)) } is a 3-dimensional, $(L)-invariant Q-subspace 
of LF] 
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#6. Let p and q be distinct primes, and 
f(v) Sat +---4+041, 
Show that f(x) is reducible over Z, if and only if there exists r € 
{1,2,...,q— 2}, such that p" = 1 (mod q). 
[Hint: Let g(x) be an irreducible factor of f(x), and let r = deg g(x) < q—1. 


Then f(x) has a root a in a finite field F' of order p". Since a is an element 
of order g in F'*, we must have q| #F™.] 


§6.8. Arithmetic subgroups of SL(n, R) 


We will briefly explain how the previous results on SL(3, R) can be generalized 
to higher dimensions. (The group SL(2,R) is a special case that does not fit 
into this pattern.) The proofs are similar to those for SL(3, R). 

In Section 6.2, and in the proof of Proposition 6.7.4, we have seen that 
cocompact arithmetic subgroups of SL(n,R) can sometimes be constructed 
by using rings in which every nonzero element has a multiplicative inverse. 
More such “division algebras” will be needed in order to construct all the 
arithmetic subgroups of SL(n, R) with n > 3. 


§6.8(i). Division algebras. 
(6.8.1) Definition. An associative ring D is a division algebra over a field F 
if 
1) D contains F in its center (that is, ef = fax for alla € Dand f € F), 
2) the element 1 € F is the identity element of D, 
3) 


4) every nonzero element of D has a multiplicative inverse. 


D is finite-dimensional as a vector space over F’, and 


Furthermore, it is central over F if the entire center of D is precisely F’. 


(6.8.2) Remarks. 
1) Dis an algebra over F if (1) and (2) hold. 
2) The word division requires (4). We also assume (3), although not all 
authors require this. 


(6.8.3) Other terminology. Division algebras are also called skew fields. 
(6.8.4) Examples. 


1) Any extension field of F' is a division algebra over F’ (but is not usually 
central). 


2) H=H ae "is a central division algebra over R. 


3) More generally, a quaternion algebra H ae is a central division algebra 
over F if and only if Nyeaq(x) 4 0, for every nonzero x € EH oe (see Ex- 


ercise 6.2#4). Note that this is consistent with (2). 
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The following famous theorem shows that division algebras are the build- 
ing blocks of simple algebras: 


(6.8.5) Theorem (Wedderburn’s Theorem). Let A be a finite-dimensional 
algebra over a field F. If A is simple (that is, if A has no nonzero, proper, 
two-sided ideals), then A & Matnxn(D), for some n and some division alge- 
bra D over F. 


Proof. Since A is finite-dimensional, we may let J be a minimal left ideal. 
Then I is a left A-module that is simple (that is, has no nonzero, proper 
submodules). So End4(J) is a division algebra (see Exercise 2); call it D. 

We have JA = A, since JA is a 2-sided ideal and A is simple. Hence, 
the minimality of J implies A = Ia; ®--- @ Ian, for some aj,...,an € A 
(see Exercise 3). 

For A considered as a left A-module, it is easy to see that each element of 
End,4(A) is multiplication on the right by an element of A (see Exercise 4); 
therefore End4(A) = A. On the other hand, it is easy to see that Ja; is 
isomorphic to I as a left A-module (see Exercise 5), so we have 


End (A) = End, (la4 OB: @D Ian) = End,(Z”) 
~ Matnxn(Enda(I)) = Mataxn(D). 
Therefore A =~ Matnxn(D). O 


(6.8.6) Corollary. If D is a central division algebra over F', then we have 
dimpr D = d?, for some d € Z*. (This integer d is called the degree of D 
over F’.) 


Proof. Let F be the algebraic closure of F’. Then (from Wedderburn’s The- 
orem), we see that D ®p F = Mataxa(D’), for some d and some central 
division algebra D’ over F. Since F is algebraically closed, we must have 


D! = F (see Exercise 6), so 


dimr D = dim¢(D @p F) = dimz Mataya(F) = d’. Oo 


In order to produce arithmetic groups from division algebras, the following 
lemma provides an analogue of the ring of integers in an algebraic number 
field F. 


(6.8.7) Lemma. If D is a division algebra over an algebraic number field F, 
then there is a subring Op of D, such that Op is a Z-lattice in D. Any such 
subring is called an order in D. 


Proof. Let {vo,v1,...,Ur} be a basis of D over Q, with vo = 1. Let 
{cf k}j.ke-0 be the structure constants of D with respect to this basis. That 


is, for j,k € {0,...,r}, we have vju~ = d7p~9 c ,ve. There is some nonzero 
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m € Z, such that MCE p € Z, for all j,k,@. Let Op be the Z-span of 
{WO ee NOE Ps 


In the proof of Proposition 6.7.4, we showed that $(L?) is a division 
algebra if p 4 ta(t) a°(t). Conversely, it is known that every division algebra 
of degree 3 arises from the above construction. (In the terminology of ring 
theory, this means that every central division algebra of degree 3 is “cyclic” ) 
Therefore, we can restate the proposition in the following more abstract form. 


(6.8.8) Proposition. Let 


e L be a cubic, Galois extension of Q, 


e D be a central division algebra of degree 3 over Q, such that D con- 
tains L as a subfield, and 


e Op be an order in D (see Lemma 6.8.7). 
Then there is an embedding ¢: D + Mats x3(R), such that 

1) (SL(1, D)) is a Q-form of Mats x3(R), and 

2) $(SL(1, Op)) is a cocompact, arithmetic subgroup of SL(3,R). 

Furthermore, o(SL(1, Op)) is essentially independent of the choice of Op 
or of the embedding ¢. Namely, if Oy and ¢' are some other choices, then 
there is an automorphism a of SL(3,R), such that ad’ (SL(1, O4)) is com- 
mensurable to ¢(SL(1, Op)). 


This generalizes in an obvious way to provide cocompact, arithmetic sub- 
groups of SL(n, R). By replacing SL(1, Op) with the more general SL(m, Op), 
we can also obtain arithmetic subgroups that are not cocompact (if n is not 
prime). 


(6.8.9) Proposition. Let 
e D be a central division algebra of degree d over Q, such that D splits 
over R, 
e meEZ, and 
e Op be Z-lattice in D that is also a subring of D. 
Then o(SL(m, Op)) is an arithmetic subgroup of SL(dm,R), for any embed- 
ding 6: D > Mataxa(R), such that (D) is a Q-form of Mataxa(R). 
It is cocompact if and only if m = 1. 


§6.8(ii). Unitary groups over division algebras. The definition of a 
unitary group is based on the Galois automorphism of a quadratic extension. 
This is a field automorphism of order 2. The following analogue makes it 
possible to define unitary groups over division algebras that are not required 
to be fields. 
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(6.8.10) Definition. Let D be a central division algebra. A map tT: D > D 
is an anti-involution if 7? = Id and 7 is an anti-automorphism; that is, 
T(a@+y) =7(x2)+7(y) and r(ay) = T(y) T(x). (Note that 7 reverses the order 
of the factors in a product.) 


(6.8.11) Other terminology. Some authors call 7 an involution, rather than 
an anti-involution, but, to avoid confusion, our terminology emphasizes the 
fact that 7 is not an automorphism (unless D is commutative). 
(6.8.12) Examples. Let D be a quaternion division algebra. Then: 
1) The map 7,: D > D defined by 
Telat+ bi +c7 +dk) =a— bi —cj — dk 
is an anti-involution. It is called the standard anti-involution of D, 
or the conjugation on D, so T(x) can also be denoted Z. 


2) The map 7,: D > D defined by 
Tr(a+bit+cj+dk)=a+bi-—cj+dk 
is an anti-involution. It is called the reversion on D. 
(6.8.13) Definitions. Let 7 be an anti-involution of a division algebra D 
over F’. 
1) A matrix A € Mat,.,(D) is said to be Hermitian (or, more precisely, 
T-Hermitian) if (AT)? = A. 
2) Given a Hermitian matrix A, we let 
SU(A,7;D) = {g € SL(n, D) | (A7)" Ag = A}. 

This notation makes it possible to state a version of Proposition 6.7.1 
that replaces the quadratic extension L with a larger division algebra. 
(6.8.14) Proposition. Let 

e L be a real quadratic extension of Q, 

e D be a central simple division algebra of degree d over L, 


e 7 be an anti-involution of D, such that T|, is the Galois automorphism 
of L over Q, 


e bi,...,bm € D*, such that 7(b;) = b; for each j, 
e Op be an order in D, and 
e [ = SU(diag(by, b2,...,0m),7; Op). 
Then: 
1) T is an arithmetic subgroup of SL(md,R). 


2) T is cocompact if and only if, for all nonzero x € D™, we have 
(x? ) diag(by, 09; «.+, 0m) a x 0. 
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Additional examples of cocompact arithmetic subgroups can be obtained 
by generalizing Proposition 6.8.14 to allow L to be a totally real quadratic 
extension of a totally real algebraic number field F’ (as in Proposition 6.7.1). 
However, in this situation, one must require b;,..., 6, to be chosen in such a 
way that SU (diag(b1, b2,..-,bm),T;Op)° is compact, for every place o of F, 
such that o 4 Id. For n > 3, every arithmetic subgroup of SL(n, R) is obtained 
either from this unitary construction or from Proposition 6.8.9 (see Theo- 
rem 18.4.1). 


Exercises for §6.8. 


#1. 


#2. 


#3. 


4A, 


#5. 


#6. 


47. 


Show Nyrea(zy) = Nrea(a) Nrea(y) for all elements x and y of a quater- 


a, 


nion algebra H;; . 


(Schur’s Lemma) Suppose A is a (finite-dimensional) F-algebra, and 
M is a simple A-module (that is finite-dimensional as a vector space 
over F’). Show End,4(M) is a division algebra, where 


End4(M) = {y: M > M | y(am) =ay(m), Vae A, me M}. 


[Hint: If y is not invertible, then it has a nontrivial kernel, which is a non- 
trivial A-submodule of M.] 


Show that if J is any minimal left-ideal of a finite-dimensional algebra A, 
then there exist a),...,@,, € A, such that A = la, ®--- ®Iay. 

[Hint: By finite-dimensionality, A = Ia; +---+Jan for some ai,...,@n € A. 
If n is minimal, then Jan Z ai +--+: + Ian—1, so the minimality of J implies 
Tan M (a1 +--+ + Ian-1) = {0}.] 


Show that if A is a ring with identity, and we consider A to be a left 
A-module, then, for every y € End,(A), there exists a € A, such that 
p(x) = xa for all x € A. 

[Hint: Let a = ¢(1).] 


For any minimal left ideal J of a ring A, and any a € A, such that 
Ia # {0}, show I & Ia as left A-modules. 


[Hint: i ++ ia is a homomorphism of modules that is obviously surjective. 
The minimality of I implies it is also injective.] 


Show that if D is a division algebra over an algebraically closed field F, 
then D = F. 


[Hint: Multiplication on the left by any x € D is a linear transformation, 
which must have an eigenvalue A € F’. Then x — X is not invertible.| 


Suppose H ag is a quaternion algebra over some field F’, and let L = 
F + Fi C HF”. 
a) Show that if a is not a square in F’, then L is a subfield of He, 
b) Show that H oe” is a two-dimensional (left) vector space over L. 
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c) For each x € H®”, define R,: Ht? > HY? by Rz(v) = vax, and 
show that R, is an L-linear transformation. 
d) For each x € E “2 show det(Rz) = Nrea(2). 


#8. Let 7 be an anti-involution on a division algebra D. 
a) For any J € Matnxn(D), define By: D” x D” > D by 
By(x,y) =7(a")Jy 
for all x,y € D” = Matn.1(D). Show that By; is a Hermitian form 
if and only if r(J7) = J. 
b) Conversely, show that if B is a Hermitian form on D”, then B = 
By, for some J € Matnxn(D). 


#9. Let D be a finite-dimensional algebra over a field F’. Show that D is a 
division algebra if and only if D has no proper, nonzero left ideals. (We 
remark that, by definition, D is stmple if and only if it has no proper, 
nonzero two-sided ideals.) 


Notes 


Generalizing the examples considered here, see Chapter 18 for the con- 
struction of all arithmetic subgroups of classical groups (except some strange 
arithmetic subgroups of groups, such as SO(1,7), whose complexification has 
SO(8, C) as a simple factor). 

The construction of all arithmetic subgroups of SL(2, R) is discussed (from 
the point of view of quaternion algebras) in [7, Chap. 5]. 

See [14, Cor. 2 of §4.3.2, p. 43] for a proof of the fact (used in Proposi- 
tion 6.4.1 and Exercise 6.7#1) that if a1,...,@n41 € Q are not all of the same 
sign, and n > 4, then the equation a,x? + 6 On410744 = 0 has a nontriv- 
ial integer solution. It is called Meyer’s Theorem, and will be used again in 
Corollary 18.6.2. 

The original paper of Gromov and Piatetski-Shapiro [5] on the construc- 
tion of nonarithmetic lattices in SO(1,n) (86.5) is highly recommended. The 
exposition there is very understandable, especially for a reader with some 
knowledge of arithmetic groups and hyperbolic manifolds. A brief treatment 
also appears in [9, App. C.2, pp. 362-364]. 

It was known quite classically that there are nonarithmetic lattices in 
SO(1, 2) (or, in other words, in SL(2,R)). This was extended to SO(1, 7), for 
n <5, by Makarov [8] and Vinberg [15]. The nonarithmetic lattices of Gromov 
and Piatetski-Shapiro [5] came later. Nonarithmetic lattices in SU(1,n) were 
constructed by Mostow [10] for n = 2, and by Deligne and Mostow [3] for 
n = 3. These results on SO(1,n) and SU(1,7) are presented briefly in [9, 
App. C, pp. 353-368]. 

The Kronecker-Weber Theorem can be found in books on Class Field 
Theory, such as [11, Thm. 5.1.10, p. 324] (or see [4]). 
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Wedderburn’s Theorem (6.8.5) is proved in [12, Thm. 3.5, p. 49], and 
other introductory texts on noncommutative rings (often in the more general 
setting of semisimple Artinian rings). 

The fact that division algebras of degree 3 are cyclic (mentioned on 
page 147) is due to Wedderburn [16], and a proof can be found in [{6, 
Thm. 2.9.17, p. 69]. Much more generally, the famous (and much more diffi- 
cult) Albert-Brauer-Hasse-Noether Theorem states that any division algebra 
(of any degree) over a finite extension of Q is cyclic. It was first proved in 
[1, 2]. See [13, proof of Thm. 32.20, p. 280] for references to more modern 
expositions of Class Field Theory that provide proofs. 
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Chapter 7 


SL(n, Z) is a lattice 
in SL(n, R) 


In this chapter, we describe two different proofs of the following crucial fact, 
which is the basic case of the fundamental fact that if G is defined over Q, then 
Gz is a lattice in G (see Theorem 5.1.11). This special case was specifically 
mentioned (without proof) in Example 5.1.12(2). 


(7.0.1) Theorem. SL(n, Z) is a lattice in SL(n, R). 


The case n = 2 of Theorem 7.0.1 was established in Example 1.3.7, by 
constructing a subset F of SL(2,R), such that 


1) SL(2,Z) - F =SL(2,R), and 
2) F has finite measure. 


Our first proof of Theorem 7.0.1 shows how to generalize this approach to 
other values of n, by choosing F to be an appropriate “Siegel set” (see Sec- 
tions 7.2 and 7.3). 


(7.0.2) Remarks. 


1) As was mentioned on page 88, the statement that Gz is a lattice in G is 
more important than the proof. The same is true of the special case in 
Theorem 7.0.1, but it is advisable to understand at least the statements 
of the three main ingredients of our first proof: 


You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. _ http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: definition of Z-lattices (Definition 5.4.1) 
and Moore Ergodicity Theorem (Section 4.10). 
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(a) the definition of a Siegel set (see Section 7.2), 

(b) the fact that every Siegel set has finite measure (see Proposi- 
tion 7.2.5), and 

(c) the fact that some Siegel set is a coarse fundamental domain for 
SL(n, Z) in SL(n, R) (cf. Theorem 7.3.1). 


2) This subject is often called Reduction Theory. The idea is that, given 
an element g of G, we would like to multiply g by an element y of I 
to make the matrix yg as simple as possible. That is, we would like to 
“reduce” g to a simpler form by multiplying it by an element of I. This 
is a generalization of the classical reduction theory of quadratic forms, 
which goes back to Gauss and others. 


Unfortunately, serious complications arise when using Siegel sets to es- 
tablish in general that Gz is a lattice in G Theorem 5.1.11 (see the proof in 
Section 19.4). Therefore, we will give a second proof with the virtue that it 
can easily be extended to establish that all arithmetic subgroups are lattices 
(see Section 7.4 for this proof of Theorem 7.0.1, and see Exercise 7.44#¢20 for 
the generalization to a proof of Theorem 5.1.11). However, this argument 
relies on a fact about SL(n,R)/SL(n,Z) that we will not prove in general 
(see Theorem 7.4.7). 


(7.0.3) Warning. The Standing Assumptions (4.0.0 on page 43) are 
not in effect in this chapter, because we are proving that [T = SL(n, Z) is 
a lattice, instead of assuming that it is a lattice. 


§7.1. Iwasawa decomposition: SL(n,R) = KAN 


The definition of a “Siegel set” is based on the following fundamental structure 
theorem: 


(7.1.1) Theorem (Iwasawa Decomposition of SL(n,R)). In G = SL(n,R), 
let 
a 
1 Kk } a2 0 . 
k=SO(n), N= » wAS 2, : 
0 ; 1 0 an 

Then G = KAN. In fact, every g € G has a unique representation of the 
form g=kau withke K,ae A, anduc N. 


Proof. It is important to note that, because of the superscript “o” in its defi- 
nition, A is only the identity component of the group of diagonal matrices; the 
entire group of diagonal matrices has a nontrivial intersection with K. With 
this in mind, the uniqueness of the decomposition is easy (see Exercise 1). 

We now prove the existence of k, a, and u. To get started, let €1,...,én 
be the standard basis of R”. Then, for any g € G, the set {ge1,...,gen}isa 
basis of R”. 
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The Gram-Schmidt Orthogonalization process constructs a corresponding 


orthonormal basis wj,..., Wn. We briefly recall how this is done: for 1 <i < 
n, inductively define 
i-l 1 
we =U; - Sow |w;)w; and w= eye where v; = gé;. 
j=l . 
It is easy to verify that w1,...,W, is an orthonormal basis of R” (see Exer- 
cise 2). 
Since {w1,...,Wn} and {€1,...,€,} are orthonormal, there is an orthog- 


onal matrix k € O(n), such that kw; = ¢; for all i. Then 
kw; = k- |lwe || wi = |lw;||(k wi) = |]; ll es, 
so there is a diagonal matrix a (with positive entries on the diagonal), such 
that 
kw; = ae; for all i. 
Also, it is easy to see (by induction) that w; € (v1,...,v;) for every 7. With 
this in mind, we have 


eg aS, -1 
Gg Wfe= ae “(v5 | tj) w Wj 


—1 


eg “Up eg” i 


=e, + (€1,...,€i-1), 
so there exists u € N, such that 
gw? = ue; for all 7. 
Therefore 
u_'g-twt =e, =a 'kw? 


so u-tg-! =a~'k. Hence, g = k~'au~! € KAN (see Exercise 4). 


Exercises for §7.1. 


#1. Show that if kyayu, = k2aqu2, with k; € K, a; € A, and u; € N, then 
ky = ka, ai = a2, and U1 = U2. 
[Hint: Show kp tke = a1uiug'az* € KN AN = {e}, so ki = kg. This implies 
aj a2 = uuz' € ANN = {e}, so a1 = a2 and wi = wu] 

#2. In the notation of the proof of Theorem 7.1.1, show {w1,...,wn} is an 
orthonormal basis of R”. 


(Hint: Calculating an inner product shows that wj wx whenever i > k.] 


#3. Show that the components k, a, and u in the Iwasawa decomposition 
g = kau are real analytic functions of g. 


[Hint: The matrix entries of a and k~' can be written explicitly in terms 
of the vectors w; and w;, which are real-analytic functions of g. Then u = 
a_'k~'g is also real analytic.] 


#4. In the proof of Theorem 7.1.1, note that: 
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e ais a diagonal matrix, but we do not know the determinant of a, 
so it is not obvious that a € SL(n, R), and 
e k € O(n), but K = SO(n), so it is not obvious that k € Kk. 
From the fact that g = k-'au~', showa€ Aandke K. 


[Hint: We know detk € {+1}, deta > 0, detu = 1, and the determinant of 
a product is the product of determinants.| 


#5. Let G = SL(n,R). 

a) Show G=KNA=ANK=NAK. 
[Hint: We have AN= NA andG=G"1'] 

b) (optional) (harder) Show G4 NKA (if n > 2). 
[Hint: For n = 2, the action of G by isometries on §)” yields a simply 
transitive action on the set of unit tangent vectors. Let v be a vertical 
tangent vector at the point 2, and let w be a horizontal tangent vector 
at the point 27. The N-orbit of w consists of horizontal vectors at points 


on the line R + 27, but vectors in the K A-orbit of v are horizontal only 
on the line R + 7.] 


#6. Show that every compact subgroup of SL(n,R) is conjugate to a sub- 
group of SO(n). 
[Hint: For every compact subgroup C of SL(n, R), there is a C-invariant inner 
product on R”, defined by (v | w) = f,,(cu- cw) de. Since (v | w) = gu- gw for 
some g € SL(n,R), the usual dot product is invariant under some conjugate 
of C. This conjugate is contained in SO(n).| 


§7.2. Siegel sets for SL(n, Z) 


(7.2.1) Example. Let [ = SL(2,Z) and G = SL(2,R). Figure 7.2A(a) (on 
page 157) depicts a well-known fundamental domain for the action of [ on 
the upper half plane §}. (We have already seen this in Figure 1.3A.) For con- 
venience, let us give it a name, say Fy. There is a corresponding fundamental 
domain Fo for [ in G, namely 


Fo ={g9€G| g(t) € Fo} 


(cf. Exercise 1). 


Unfortunately, the shape of Fo is not entirely trivial, because the bottom 
edge is curved. Furthermore, the shape of a fundamental domain gets much 
more complicated when G is larger than just SL(2,R). Therefore, we will 
content ourselves with finding a set that is easier to describe, and is close to 
being a fundamental domain. 


(7.2.2) Example. To construct a region that is simpler than Fo, we can 
replace the curved edge with an edge that is straight. Also, because we do 
not need to find precisely a fundamental domain, we can be a bit sloppy 
about exactly where to place the edges, so we can enlarge the region slightly 
by moving the edges out a bit. The result is depicted in Figure 7.2A(b). 
This new region F is slightly larger than a fundamental domain, but it is 
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1 1 1 
—1 5 5 1 -l a5 0 a 1 
Figure 7.2A(a). A fundamen- Figure 7.2A(b). A coarse fun- 
tal domain Fo. damental domain F. 


within bounded distance of a fundamental domain, and it suffices for many 
purposes. In particular, it is a coarse fundamental domain, in the sense of 
Definition 4.7.2 (see Exercise 6). 

An important virtue of this particular coarse fundamental domain is that 
it can be specified quite easily: 


Fa{otyi 


for appropriate C1, C2,c3 € R. 


cS 2 <9, 
CAG 


By using the Iwasawa decomposition G = NA K (see Exercise 7.14£5), we 
can give a fairly simple description of the corresponding coarse fundamental 
domain F in SL(2,R): 

(7.2.3) Example. Let 
© F={9eG| gli) €F}, 


oe 
ale 


e kK =SO(2). 
Then (see Exercise 10) 


F = Nex cp Ac K. 
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Any set of the form Ne, .c,Ac,K is called a “Siegel set? so we can summa- 
rize this discussion by saying that Siegel sets provide good examples of coarse 
fundamental domains for SL(2, Z) in SL(2,R). 

To construct a coarse fundamental domain for SL(n, Z) (with n > 2), we 
generalize the notion of Siegel set to SL(n, R). 


(7.2.4) Definition (Siegel sets for SL(n, Z)). Let G = SL(n, R), and consider 
the Iwasawa decomposition G = NAK (see Exercise 7.145). To generalize 
Example 7.2.3, we construct a “Siegel set” S by choosing appropriate subsets 
N of N and A of A, and letting S= NAK. 


e The set N can be any (nonempty) compact subset of N. For example, 
we could let 


N=Noec. ={UEN | c1 < uij < ce fori <j}. 


e Note that the set A,, of Example 7.2.3 has the following alternate 
description: 


Acs ={aceA|aii > 3 a2,2 }. 


Therefore, we can generalize to SL(n,R) by defining 


Ag= 166 Al| agg > Canggin tort 1 sng Tl fs 


Thus, for cj,c2 € R and c3 € R™, we have a Siegel set 
S) = Na asAbe les 


€15€2,C3 


By calculating an appropriate multiple integral, it is not difficult to see 
that Siegel sets have finite measure: 


(7.2.5) Proposition (see Exercise 14). 8¢, ¢5,c, has finite measure (with re- 
spect to the Haar measure on SL(n,R)). 


Exercises for §7.2. 


#1. Suppose H is a closed subgroup of G, and F is a strict fundamental 
domain for the action of [ on G/H. For every x € G/H, show that 


F={g9E€G|greEF} 


is a strict fundamental domain for [ in G. 


#2. Suppose F; and F»2 are coarse fundamental domains for [ in G. Show 
that if F; C F C Fo, then F is also a coarse fundamental domain for 
in G. 


#3. Suppose 
e F is a coarse fundamental domain for the action of T on X, and 
e F is a nonempty, finite subset of I. 
Show that FF = User fF is also a coarse fundamental domain. 


4A, 


#5. 


#6. 


#7. 


#8. 


9, 


4£10. 


#11. 


#12. 
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Suppose H is a closed subgroup of G, and F is a coarse fundamental 
domain for the action of T on G/H. For every x € G/H, show that 


F={ge€G|gr EF} 
is a coarse fundamental domain for T in G. 


In the notation of Figure 7.2A, show that the coarse fundamental do- 
main F is contained in the union of finitely many I-translates of the 
fundamental domain Fo. 


Show that the set F depicted in Figure 7.2A(b) is indeed a coarse 
fundamental domain for the action of [ on §. 


[Hint: Exercises 3 and 5. You may assume (without proof) that Fo is a 
fundamental domain.] 


Suppose 
e F is a coarse fundamental domain for [ in G, and 
e F is a nonempty, finite subset of , and 
e I’; is a finite-index subgroup of I. 
Show: 
a) that F\F is also a coarse fundamental domain for [ in G. 
b) If [) FF = G, then F\F is a coarse fundamental domain for both 
T and Ty; in G. 


Assume I is infinite (or, equivalently, that G is not compact), and [; 
is a finite-index, proper subgroup of [. Show there exists a (strict) 
fundamental domain for T, in G that is not contained in any coarse 
fundamental domain for [in G. 
[Hint: Construct a strict fundamental domain for [; that contains a strict 
fundamental domain Fo for I’, but is not covered by finitely many I’-translates 
of Fo: 
Suppose 

e F is a coarse fundamental domain for [ in G, and 

eg ENe(L). 
Show that F9 = g~!Fgq is also a coarse fundamental domain. 
Let F be the coarse fundamental domain for SL(2, Z) in SL(2, R) that 
is defined in Example 7.2.3. Verify that F = Ne, c.Ac,K. 


Let G = SL(n,R). Given c > 0, show there exists a € A, such that 
Ae = aAt. 


This exercise provides a description of the Haar measure on G. 

Let dg, dk, da, and du be the Haar measures on the unimodular 
groups G, K, A, and N, respectively, where G = KAN is an Iwasawa 
decomposition. Also, for a € A, let p(a) be the modulus (or Jacobian) 
of the action of a on N by conjugation, so 


| f(a~'ua) du =| f(u) p(a) du for f € C.(N). 
N N 
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Show, for f € C.(G), that 


Fast imi da du dk 
= fff seven) 0 a) du da dk. 


[Hint: Since G is unimodular, dg is invariant under left translation by ele- 
ments of K and right translation by elements of AN.| 


#13. Let G = SL(n,R), choose N, A, and K as in Definition 7.2.4, and define 
p as in Exercise 12. Show 


Q1,1 
a2 ( Ty i 


QAn,n 
#14. Let c1,c2,c3 € R, with cy < cg and c3 > 0. Show that the Siegel set 
S¢,,c9,c, in SL(n, R) has finite measure. 


[Hint: See Exercises 12 and 13 for a description of the Haar measure on 
SL(n, R).] 


§7.3. Constructive proof using Siegel sets 
In this section, we prove the following result: 
(7.3.1) Theorem. Let 
e G=SL(n,R), 
e f=SL(n,Z), and 
e 80,1,4 = No,iA1/2K be the Siegel set defined in Definition 7.2.4. 


This establishes Theorem 7.0.1: 


Proof of Theorem 7.0.1. Combine the conclusion of Theorem 7.3.1 with 
Propositions 4.1.11 and 7.2.5. L] 


(7.3.2) Remarks. 


1) Tis written on the left in the conclusion of Theorem 7.3.1, because our 
definition of Siegel sets is motivated by a fundamental domain for the 
action of SL(2,Z) on 7, and [ acts on the left there. However, taking 
the transpose of both sides of the conclusion of Theorem 7.3.1 yields 
C= 8h 7 if, where 8h 01,4 = = KAj/2NZ. c,: Lhus, I’ can be written on 
the right, ‘if the definition of Siegel set is modified appropriately. 


2) Our definition of Siegel sets uses the upper-triangular group N, and 
Theorem 7.3.1 puts [’ on the left. Then (1) uses the lower-triangular 
group N? (also called N~), and puts I on the right. Some authors 
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reverse this, using N~ when I is on the left and using N when the 
action on the right. However, to accomplish this, the inequality in the 
definition of A, needs to be reversed. (See Exercise 1 and the proof of 
Theorem 7.3.1.) 


The following elementary observation is the crux of the proof of Theo- 
rem 7.3.1: 


(7.3.3) Lemma. If £ is any Z-lattice in R", then there is an ordered basis 
U1,--+,Un Of R”, such that 


1) {u1,...,Un} generates L as an abelian group, and 


2) || proj; visill = 5 {| projz_y u;|| for 1 <i < n, where proj; : R” > 
V,+ is the orthogonal projection onto the orthogonal complement of the 
subspace V; spanned by {v1,v2,..-, vi}. 


Proof. Choose v; to be a nonzero vector of minimal length in £. Then define 
the remaining vectors v2, U3,...,Un by induction, as follows: 
Given v1, V2,...,U;, choose vj41 € L to make projy vist as short as 
possible, subject to the constraint that v;+1 is linearly independent 
from {v1,U2,...,v;} (so proj v441 is nonzero). 
We now verify (1) and (2). 
(1) For each i, let £; be the abelian group generated by v1, v2,...,u;. If 
Ln #L, we may let 7 be minimal with £34; A £AVi41. Then we must have 
proj; ie eee proj; (L 1 Vi41) (see Exercise 2), so there is some v € LO Vi41 
with proj; t= k-projy v for some k > 2 (see Exercise 3). This contradicts 
the minimality of || proj? v;411|. 
(2) For simplicity, assume i = 1 (see Exercise 4), and let v3 = projt va, 
SO Vg = vz + avi, with a € R. Obviously, there exists k € Z, such that 
la — k| < 1/2. If || projy vel] < 3||vi], then 
||v2 — kvil] = |lvg + (a — k)vi|| < ||v2|| + la — I - [lus 


U + U U{\\- 
2 : 2 


This contradicts the minimality of ||v;]]. 


Proof of Theorem 7.3.1. We wish to show G =189,,.1 =I Noi Aijo K. 
However, since the proof uses an action of G, and most readers prefer to have 
this action on the left, we will instead prove an analogous result with [ on 


the right: G = 5), 1 T°. Namely, given g € G, 
aw} 


we will show g € K Ajj, Nol, 


where Ay ={a7!|aE€ A} ={aE€ A] aii < Gigi i4i/c for all i}. 
For convenience, let £ = gZ”, and let {€1,...,€,} be the standard basis 
of R”. Lemma 7.3.3 provides us with a sequence v1,...,Un of elements of L. 


From 7.3.3(1), we see that, after multiplying g on the right by an element 
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of T, we may assume 
ge =u; ford Sle ott 
(see Exercise 6). 

From the Iwasawa decomposition G = KAN (see Theorem 7.1.1), we 
may write g = kau with k € K,a € A, and u € N. For simplicity, let us 
assume k; is trivial (see Exercise 7), so 

g=auwithac Aanduc N. 
Since g € AN, we know g is upper triangular (and its diagonal entries are 
exactly the same as the diagonal entries of a), so 
(€1,€2, fst ei) a (9E1, GE2; eed 5 GE%) = (v1, V2, fone , Ui) for all 2. 
This implies that the diagonal entry a;,; of a is given by 
‘ Pall 
Gis = Gia = || Projj_, geil| = || projj_, val 
< 2\| projy vi¢all| = 2|| proj; gesaill = 2oc41 641 = 2aii 41. 

Therefore a € Ay)»: 

Also, there exists y € [TM N, such that u € Noivy (see Exercise 8). 


Therefore g = au € Ay)» Noa y © K Aj), Noi T, as desired. (el 


(7.3.4) Remark. It can be shown that that the Siegel set 8o,1,4 is a coarse 
fundamental domain for SL(n,Z) in SL(n,R) (cf. Subsection 19.4(ii)), but 
this fact is not needed in the proof that SL(n, Z) is a lattice in SL(n, R). 


Exercises for §7.3. 


#1. Let 
bs Nix =4 |} 9 | a<t<eh}, 


°4,={[" +] | Se} 

e kK =SO(2), and 

oF SNe a AG. KG 
Show that F’ is a coarse fundamental domain for SL(2,Z) in SL(2, R) 
if and only if the set F = N¢, -,Ac,K of Example 7.2.3 is a coarse 
fundamental domain. 


[Hint: Conjugate by ° 4] .| 


#2. In the notation of Lemma 7.3.3, show that if X and Y are two subgroups 
of V;41, such that 
XCY, XNVY=YNYV,, and projt X = proj/ Y, 
then X =Y. 


#3. In the notation of Lemma 7.3.3, show that the group projy (LN Vi41) 
is cyclic. 
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[Hint: Since dim proj} Vii1 = 1, it suffices to show projt (LM Vi41) is dis- 
crete. | 

#4. Prove Lemma 7.3.3(2) without assuming i = 1. 
[Hint: Mod out Vi-1, which is in the kernel of both proj;-, and projj-.] 


#5. For g € GL(n,R), show g € GL(n,Z) if and only if gZ” C Z” and 
Go er. 


#6. For every n-element generating set {v1,...,Un,} of the group Z”, show 
there exists y € SL(n, Z), such that ge; = +u; for every 1. 
[Hint: Show there exists y € GL(n, Z), such that ge; = v; for every 7.] 


#7. Complete the proof of Theorem 7.3.1 (without assuming the element k 
is trivial). 
[Hint: The group K acts by isometries on R”, so replacing {v1,...,Un} with 
its image under an element of K does not affect the validity of 7.3.3(2).] 


#8. For all c € R, show N = Ne 41 Nz. 


§7.4. Elegant proof using nondivergence of unipotent orbits 


We now present a very nice proof of Theorem 7.0.1 that relies on two key 
facts: the Moore Ergodicity Theorem (4.10.3), and an important observation 
about orbits of unipotent elements (Theorem 7.4.7). The statement of this 
observation will be more enlightening after some introductory remarks. 


(7.4.1) Example. Let a = i v/9]: or, more generally, let a be any element 


of SL(2,R) that is diagonalizable over R (and is not +Id). Then a has one 
eigenvalue that is greater than 1, and one eigenvalue that is less than 1 (in 
absolute value), so it is obvious that there exist linearly independent vectors 


v4 and v_ in R?, such that 


a®*vs +0 and a-*v_ 30 ask— +00. 


By the Mahler Compactness Criterion (4.4.7), this implies that some of the 
orbits of a on SL(2,R)/SL(2,Z) are “divergent” or “go off to infinity” or 
“leave compact all sets’ That is, there exists x € SL(2,R)/SL(2,Z), such 
that, for every compact subset C’ of SL(2,R)/SL(2, Z), 


{keZ|a*x €C} is finite 


(see Exercise 1). 


In contrast, if u = 4 th then it is clear that there does not exist a 


nonzero vector v € R?, such that u*v > 0 as k > oo. In fact, if v is not fixed 
by u (ie., if uv ~ v), then 
\Ju*v|| 3 co as k > +00 (7.4.2) 


(see Exercise 2). Therefore, it is not very difficult to show that none of the 
orbits of u on SL(2,R)/SL(2, Z) go off to infinity: 
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(7.4.3) Proposition. Jf u is any unipotent element of SL(2,R), then, for 
all x € SL(2,R)/SL(2, Z), there is a compact subset C of SL(2,R)/SL(2,Z), 
such that 


{ke Zt |u*a €C} is infinite. 


Proof. We may assume u is nontrivial. Then, by passing to a conjugate (and 
perhaps taking the inverse), we may assume u = [4 {]. 

Choose a small neighborhood © of 0 in R? so that, for all g € SL(2,R), 
there do not exist two linearly independent vectors in ONgZ? (see Exercise 4). 


Since xZ? is discrete, we may assume © is small enough that 
On«Z? = {0}. (7.4.4) 


Since O is open and 0 is a fixed point of u (and the action of u~+ is continuous), 
there exists r > 0, such that 


B,(0) Uu~'B,(0) € O, (7.4.5) 
where B,(0) is the open ball of radius r around 0. Let 
C = {c€ SL(2,R)/SL(2, Z) | cZ?M B,(0) = {0} }. 
The Mahler Compactness Criterion (4.4.7) tells us that C' is compact. 


Given N € Zt, it suffices to show there exists k > 0, such that uN t*x € 
C. That is, 


we wish to show there exists k > 0, such that uN t*2Z? n B,(0) = {0}. 


Let v be a nonzero vector of smallest length in uN 7Z?. We may assume 
\|v|| <r (for otherwise we may let k = 0). Hence, (7.4.4) implies that v is 
not fixed by u. Then, from (7.4.2), we know there is some k > 0, such that 
||u*vl|| > r, and we may assume k is minimal with this property. Therefore 
||u*-20|| < 7, so u®—1y € B,(0) C O by (7.4.5). 

From the choice of O, we know that ON uNt+*-!xZ does not contain 
any vector that is linearly independent from u*~!v. Therefore uN +*xZ? does 
not contain any nonzero vectors of length less than r (see Exercise 5), as 
desired. O 


This result has a natural generalization to SL(n, R) (but the proof is more 
difficult; see Section 7.5): 


(7.4.6) Theorem (Margulis). Suppose 
e u is a unipotent element of SL(n,R), and 
e x © SL(n,R)/SL(n, Z). 
Then there exists a compact subset C of SL(n,R)/SL(n, Z), such that 
{ke Z* |u*a €C} is infinite. 


In other words, every unipotent orbit visits some compact set infinitely 
many times. In fact, it can be shown that the orbit visits the compact set 
quite often — it spends a nonzero fraction of its life in the set: 
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(7.4.7) Theorem (Dani-Margulis). Suppose 


e u is a unipotent element of SL(n,R), and 
e x € SL(n,R)/SL(n, Z). 
Then there exists a compact subset C' of SL(n,R)/SL(n,Z), such that 
ein ep eee kr eC 
limint #4 Se: tt}, ee oo 


m—oo m 


Before saying anything about the proof of this important fact, let us see 
how it implies the main result of this chapter: 
Proof of Theorem 7.0.1. Let 

e X =SL(n, R)/SL(n, Z), and 

e 4 be an SL(n, R)-invariant measure on X (see Proposition 4.1.3). 


We wish to show p(X) < oo. 
Fix a nontrivial unipotent element u of SL(n,R). For each x € X and 
compact C' C X, let 


ke 4{1,2,.22;m) | u’re C 
pc(a) a { } | i 
m— oo m 
Since X can be covered by countably many compact sets, Theorem 7.4.7 
implies there is a compact set C’, such that 


pc > 0 on a set of positive measure (7.4.8) 


(see Exercise 6). Letting Xc be the characteristic function of C’, we have 


EIT e.: eC 
[ vcan= | lim int #4 i, wa = Ne 
x Be 


m— oo m 
Fatou’s 
peg oe see neo 
< Kimint f ua { ms | — Layla) Lemma 
en ae a (B6.4) 


1 
=liminf — | (Xy-1¢ + Xu-20 $207 + Xu-me) du 
x 


moo m 


1 
= liminf — (| Xuarcdu+ f Xuron tet f Xy-medi) 


_ 4: . 1 —1 —2 —m 
= liminf — (n(u C)+p(u “C)+---+ u(u c)) 


m—- 


= ieague = (u(C) + w(C) +--+ 4(C)) 


m—- oo 

= u(C) 

< OO, 

so pc € LX, p). 
It is easy to see that po is u-invariant (see Exercise 7), so the Moore 
Ergodicity Theorem (4.10.3) implies that po is constant (a.e.). Also, from 
(7.4.8), we know that the constant is not 0. Therefore, we have a nonzero 
constant function that is in £1(X,), which tells us that p(X) is finite. 
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Now, to begin our discussion of the proof of Theorem 7.4.7, we introduce 
a bit of terminology and notation, and make some simple observations. First 
of all, let us restate the result by using the Mahler Compactness Criterion 
(4.4.7), and also replace the discrete times {1,2,3,...,m} with a continuous 
interval [0,7]. Exercise 10 shows that this new version implies the original. 


(7.4.9) Definition. For any Z-lattice £ in R”, there is some g € GL(n,R), 
such that L = gZ”. We say L is unimodular if det g = +1. 


(7.4.10) Theorem (restatement of Theorem 7.4.7). Suppose 


e {u'} is a one-parameter unipotent subgroup of SL(n,R), 


e £ is a unimodular Z-lattice in R", and 


e | is the usual Lebesgue measure (t.e., length) on R. 
Then there exists a neighborhood O of 0 in R”, such that 


i({e E[0,T] | wl£nO = {0} +) 
lim inf 
TS 0 T 


pals 


(7.4.11) Notation. Suppose W is a discrete subgroup of R”. 
e A vector w € W is primitive in W if Aw ¢ W, for0<A< 1. 
° Let W be the set of primitive vectors in W. 
e Let Wt C W be a set of representatives that contains either w or —w, 


but not both, for every w € W. (Note that W = _W: see Exercise 11.) 


For simplicity, let us assume now that n = 2 (see Section 7.5 for a discus- 
sion of the general case). 
(7.4.12) Lemma. 
1) There is a neighborhood O, of 0 in R?, such that if W is any unimod- 
ular Z-lattice in R?, then #(Wt n 01) a 


2) Given any neighborhood O, of 0 in R?, and any € > 0, there exists 
a neighborhood Og of 0 in R?, such that if x € R*, and [a,b] is an 
interval in R, such that there exists t € [a,b] with u'x ¢ O,, then 


L({t € [a, }] | u'x € Oo }) < el ({tE [aD] | u'x € O;}). 


Proof. (1) A unimodular Z-lattice in R? cannot contain two linearly inde- 
pendent vectors of norm less than 1 (see Exercise 12). 

(2) Note that u’a moves at constant velocity along a straight line (see Ex- 
ercise 13). So we simply wish to choose Oz small enough that every line seg- 
ment that reaches the boundary of OQ; has only a small fraction of its length 
inside Op» (cf. Figure 7.4A). 

By making ©; smaller, there is no harm in assuming it is a disk centered 
at 0. Let R be the radius of O,, and let Og be a disk of radius r centered 
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Figure 7.4A. Part (2) of Lemma 7.4.12: Any line segment 
that reaches the boundary of the large disk has only a small 
fraction of its length inside the tiny disk. 


at 0, with r small enough that 
2r 
R-r 
Then, for any line segment L that reaches both O2 and the boundary of O,, 
we have: 


<e€. 


e the length of DM Q2 is < the diameter 2r of Oz, and 
e the length of LM QO, is > the distance R — r from OO, to 0O3. 


Therefore, the segment of LZ that is in O2 has length less than ¢€ times the 
length of the segment that is O, (cf. Figure 7.4A). 


Proof of Theorem 7.4.10 when n = 2. Let 0; and QO» be as described 
in Lemma 7.4.12, with « = 1/2. We may assume Q, and Oz are convex, that 
they are small enough that they contain no nonzero elements of £, and that 
Oz C Oj. - 

Fix T € Rt. For each x € LT, and k = 1,2, let 


Ste 0,7)|-2u € Op}. 


Since O; is convex, and u‘z traces out a line (see Exercise 13), we know that 
I* is an interval (possibly empty). Note that: 


1) from Lemma 7.4.12(2) (and the fact that « = 1/2), we see that I(I?) < 
sl(I}), and 


2) from Lemma 7.4.12(1), we see that J}, is disjoint from I}, whenever 
Ae eo 
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Therefore 
i({t€ [0,7] |uwL£NO, 4 {0})= Sou) < (Zs) 
9 2 - x a ~ 9 
welt welt 
1 i 1 fk 
= _— < _ =—_, 
5! U Ir | <54(0,T]) = 5 
xELt 
So, passing to the complement, we have 
as 
U({t € [0,T] | wLM O2 = {0}) 2 5. oO 


Unfortunately, Theorem 7.4.10 is not nearly as easy to prove when n > 2, 
because two basic complications arise. 


1) 


The first difficulty is that the u-orbit of a vector is usually not a straight 
line (contrary to Exercise 13 for n = 2). However, the coordinates of 
u‘x are always polynomials of bounded degree (see Exercise 14), so, for 
any fixed vector 2, 


the function |/u‘||? is a polynomial in t 


and the degree of this polynomial is bounded (independent of x). There- 
fore, it is easy to prove that the appropriate analogue of Lemma 7.4.12(2) 
holds even if n > 2 (see Exercise 15), so the nonlinearity is not a major 
problem. 


A much more serious difficulty is the failure of 7.4.12(1): ifn > 2, then 
a unimodular lattice in R” may have two linearly independent primitive 
vectors that are very small (see Exercise 16). This means that the sets 
I? in the above proof may not be disjoint, which is a major problem. 
It is solved by looking at not only single vectors, but at larger sets of 
linearly independent vectors. More precisely, we look at the subgroups 
generated by sets of small vectors in u’£. These subgroups can intersect 
in rather complicated ways, and sorting this out requires a study of 
chains of these subgroups (ordered by inclusion) and a rather delicate 
proof by induction. Although the proof is completely elementary, using 
only some observations about polynomial functions, it is very clever 
and intricate. The main idea is presented in Section 7.5. 


Exercises for §7.4. 


#1. 


Suppose a € SL(2,R), and there exist linearly independent vectors v+ 
and v_— in R?, such that 

av, +0 asn—-++oo and a”v_ > 0 as n —- —o0. 
Show 4 x € SL(2,R)/SL(2, Z), such that {n € Z| ax € C } is finite, 
for every compact subset C of SL(2,R)/SL(2, Z). 


#2. 


#3. 


4A, 


#65. 


#6. 


#7. 


#8. 


#9. 
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[Hint: There exists g € SL(2,R) that takes the two standard basis vectors 
of R? to vectors that are scalar multiples of v; and v_.] 


Let u = [$7]. For every v € R?, show that either 
e uv = v for all n € Z, or 
e |ju"v|| 4 co as n > 0. 


Generalize the preceding exercise to SL(n, R): 
Let u be any unipotent element of SL(n,R). For every v € R”, 
show that either 
e uv =v for all n € Z, or 
e |ju"v|| 4 co as n > oo. 


[Hint: Each coordinate of u”v is a polynomial function of n, and non-constant 
polynomials cannot be bounded.] 


Suppose v; and v2 are linearly independent vectors in Z?, and we have 
g € SL(2, R). Show that if ||gv1|| < 1, then ||gve|| > 1. 
[Hint: Since g € SL(2,R), the area of the parallelogram spanned by the 


vectors gv1 and gv2 is the same as the area of the parallelogram spanned by 
v1 and v2, which is an integer.| 


Near the end of the proof of Proposition 7.4.3, verify the assertion that 
u"tN ¢Z? does not contain any nonzero vectors of length less than r. 


[Hint: If ||w|| <r, then u”~'v and u~‘*w are linearly independent vectors in 
On ut N-147,] 


Prove (7.4.8). 


[Hint: X cannot be the union of countably many sets of measure 0.| 


In the proof of Theorem 7.0.1, verify (directly from the definition) that 
Pc is u-invariant. 


Show Definition 7.4.9 is well-defined. More precisely, given any gj, g2 € 
GL(n,R), such that g1Z” = g2Z”, show 


det gr. € {41} <= det ge € {+1}. 


Assume 
e wu’ is a one-parameter unipotent subgroup of G, 
e x €G/T, and 
e C* is a compact subset of G/T. 

Show that if 


i({t€ [0,7]| ute ecr}) 
lim inf 


T—0o T 
then there is a compact subset C' of G/T, such that 


aoe eo eee Ke eC 
inane { ed ae args 


m—oo m 


[Hint: Let C = Usepo,1) u'C* | 


0, 


170 


4£10. 


#11. 


#12. 


413. 


4614. 


4£15. 


4£16. 


#17. 
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Show Theorem 7.4.10 implies Theorem 7.4.7. 
[Hint: Mahler Compactness Criterion (4.4.7) and Exercise 9.] 


Suppose w is a nonzero element of a discrete subgroup W of R”. Show 
the following are equivalent: 

a) w is primitive in W. 

b) RwOW = {Zu}. 

c) If kw’ = w, for some k € Z and w’ € W, then k € {+1}. 

d) —w is primitive in W. 


Suppose v and w are linearly independent vectors in a unimodular Z- 
lattice in R?. Show ||v|| - ||w|] > 1. 


Show that if 2 € R?, and {u*} is any nontrivial one-parameter unipo- 
tent subgroup of SL(2,R), then u’x moves at constant velocity along a 
straight line. 


[Hint: Calculate the coordinates of u‘x after choosing a basis so that u’ = 
[o 1}. 

Given n € Z™, show there is a constant D, such that if 2 € R”, and 
{u'} is any one-parameter unipotent subgroup of SL(n,R), then the 
coordinates of u’x are polynomial functions of t, and the degrees of 
these polynomials are < D. 


[Hint: We have u* = exp(tv) for some v € Matnyxn(R). Furthermore, v is 
nilpotent, because u’ is unipotent, so the power series exp(tv) is just a poly- 
nomial. ] 


Given R, D,¢ > 0, show there exists r > 0, such that if 

e f(x) is a (real) polynomial of degree < D, and 

e [a,b] is an interval in R, with |f(t)| > R for some t € [a, d], 
then 


i({t € [2,8] | |[f@)|<r}) <el({t € [a,b] | FO < R}). 


[Hint: If not, then taking a limit yields a polynomial of degree D that vanishes 
on a set of positive measure, but is > R at some point.] 


For every € > 0, find a unimodular Z-lattice £ in R” with n—1 linearly 
independent primitive vectors of norm < e. 


Assume G is defined over Q (and connected). Show there exist 
e a finite-dimensional real vector space V, 
e a vector v in V, and 
e a homomorphism p: SL(é,R) > SL(V), 
such that 
a) G = Stabszery(v)°, and 
b) p(SL(Z, Z))v is discrete. 
[Hint: See the hint to Exercise A4#8, and choose v to be the exterior product 
of polynomials with integer coefficients. | 
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#18. Show that if G is defined over Q, then the natural embedding G/Gz © 
SL(é,R)/SL(Z, Z) is a proper map. 
[Hint: Use Exercise 17.] 


#19. Prove Theorem 5.1.11 under the additional assumption that G is simple. 


[Hint: The natural embedding G/Gz @ SL(é,R)/SL(é, Z) is a proper map 
(see Exercise 18), so the G-invariant measure on G'/Gz provides a G-invariant 
measure 4 on SL(é,R)/SL(¢, Z), such that all compact sets have finite mea- 
sure. The proof of Theorem 7.0.1 (with u € G) implies that y is finite.| 


#20. Prove Theorem 5.1.11 (without assuming that G' is simple). 


[Hint: You may assume Exercise 11.2#10 (without proof). This provides a 
version of the Moore Ergodicity Theorem for groups that are not simple.| 


§7.5. Proof that unipotent orbits return to a compact set 


The proof of Theorem 7.4.10 is rather complicated. To provide the gist of 
the argument, while eliminating some of the estimates that obscure the main 
ideas, we prove only Theorem 7.4.6, which is a qualitative version of the 
result. (The quantitative conclusion in Theorem 7.4.10 makes additional use 
of observations similar to Lemma 7.4.12(2) and Exercise 7.4#15.) This section 
is optional, because none of the material is needed elsewhere in the book. 

By the Mahler Compactness Criterion (and an appropriate modification 
of Exercise 7.4449), it suffices to prove the following statement: 


(7.5.1) Theorem (restatement of Theorem 7.4.6). Suppose 


e {u'} is a one-parameter unipotent subgroup of SL(n,R), and 


e £ is a unimodular Z-lattice in R”. 
Then there exists a neighborhood O of 0 in R”, such that 
{teERt | wLnO = {0} } is unbounded. 


(7.5.2) Definition. Suppose 


e W is a discrete subgroup of R”, and 


e kis the dimension of the linear span (W) of W. 


We make the following definitions: 


1) We define an inner product on the exterior power /* R” by declaring 
{€i, \€ig A+++ A &;, } to be an orthonormal basis, where {€1,...,€,} is 
the standard basis of R”. 


2) Since \* W is cyclic (see Exercise 3), it has a generator w, A-:+ A we 
that is unique up to sign, and we define 


d(W) = ||lwi A--- A wall. 
(However, by convention, we let d({0}) = 1.) 
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(7.5.3) Remark. If W is the cyclic group generated by a nonzero vector 
w € R”, then it is obvious that d(W) = ||w||. Therefore, Definition 7.5.2(2) 
presents a notion that generalizes the norm of a vector. 


The following generalization of Exercise 7.4#14 is straightforward to 
prove (see Exercise 5). 


(7.5.4) Lemma. Suppose 


e {u'} is a one-parameter unipotent subgroup of SL(n,R), and 


e W is a discrete subgroup of R”. 


Then d(u'W )? is a polynomial function of t, and the degree of this polynomial 
is bounded by a constant D that depends only on n. 


Lemma 7.5.4 allows us to make good use of the following two basic proper- 
ties of polynomials of bounded degree. (See Exercises 6 and 7 for the proofs.) 
The first follows from the observation that polynomials of bounded degree 
form a finite-dimensional real vector space, so any closed, bounded subset is 
compact. The second uses the fact that nonzero polynomials of degree D 
cannot have more than D zeroes. 


(7.5.5) Lemma. Suppose D € Z*, « > 0, and f is any real polynomial of 
degree < D. Then there exists C > 1, depending only on D and «, such that, 
for all T,7 > 0: 
1) If f(s) > 7 for some s € [0,T], and |f(T)| < 7/C, then there exists 
t € [0,eT], such that | f(T + t)| =7/C. 
2) If |f(s)| < 7 for alls € [0,T], and f(T) = 7, then there exists Ty € 
[T,4?T], such that 
T/C <|f()|<7C for allt € [T%, 2T)]. 


(7.5.6) Notation. Suppose CL is a Z-lattice in R”. 
e A subgroup W of CL is full if it is the intersection of £ with a vector 
subspace of R”. (This is equivalent to requiring L/W to be torsion- 
free.) 


e Let S(L) be the collection of all full, nontrivial subgroups of £, partially 
ordered by inclusion. 

e For W C £, we let (W)c be the (unique) smallest full subgroup of £ 
that contains W. In other words, (W)c = (W) NL. 


The following simple observation uses full subgroups of £L to provide a 
crucial lower bound on the norms of vectors (see Exercise 8): 
d((W, v)c) 
d((W)c) 
We can now prove Theorem 7.5.1. However, to avoid the need for a proof 
by induction, we assume n = 3. 


(7.5.7) Lemma. If W € 8(£L) andv € L\ W, then |lv|| > 
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Proof of Theorem 7.5.1 when n = 3. It is easy to see that 
{W € 8&(L) | d(W) <1} is finite 
(see Exercise 10). Hence, there exists tT > 0, such that 
d(W) > 7, for all W € S8(L). 

Let: 

e D be the constant provided by Lemma 7.5.4, 

e «=4- and 

e C be the constant provided by Lemma 7.5.5. 


Given T > 0, it suffices to find R > 0, such that |ju?+”v|| > 7/C? for all 
nonzero v € L. 
Let 


D={WES(L) | d(u’W) <7/C}. 
We assume D 4 §) (otherwise, we could let R = 0). For each W € D, 
Lemma 7.5.5(1) implies 

there exists tw € (0, eZ], such that d(u’t'”W) = 7/C. 

By choosing tw minimal, we may assume 

d(u't*W) <1/C for all t € [0, tw). 
Since D is finite (see Exercise 10), we may 

fix some W* € D that maximizes ty. 


From Lemma 7.5.5(2), we see that there exists 
fh 
Ty = [tw+,4?tw+] & ws 4 ’ 


such that 
T/C? < d(u? *W*) < 7C? for all t € [Ty, 2T)]. 


Since dim(L) = n = 3, we know dim(W 7) is either 1 or 2. To be concrete, 
let us assume it is 2. (See Exercise 11 for the other case.) Then, for any 
v€ LNW, we have (W*,v)¢ = L, so Lemma 7.5.7 implies ||u?**v|| > 1/7 
for all t € [T,,2T,]. Hence, it is only the vectors in Wt that can be small 
anywhere in this interval. 

Therefore, we may assume there is some nonzero v9 € WT, such that 
||u? +72 u9|| < 7/C?. There is no harm in assuming that Zvp is a full subgroup 
of £. Then, since T; > tw+, the maximality of ty+ implies ||u?+%vg|| > 7/C 
for some s € [0,7;]. Therefore, Lemma 7.5.5(1) provides some t € [T), 27%], 
such that ||ju?+¢v9|| = 7/C?. Now, for any nonzero v € L, 


either (v)c = (vo)c, or (vo, v)c =W*, or (v,Wt)c =L. 


In each case, we see (by using Lemma 7.5.7 in the latter two cases) that 
\|u? ttyl] > + /C? (if r <1). 
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Exercises for §7.5. 


#1, 
462. 


#$3. 


HA, 


$9. 
#6. 


#7. 


#8. 


#9. 


410. 


Show that Theorem 7.5.1 is a corollary of Theorem 7.4.10. 


Use Theorem 7.5.1 (and not Theorem 7.4.7 or Theorem 7.4.10) to show 
that if {u’}, X, and x are as in Theorem 7.4.7, then there exists a 
compact subset K of X, such that 


{t ER | uae Kk} is unbounded. 


In the notation of Definition 7.5.2, show /\’W is cyclic. 


[Hint: If {wi,...,we} generates W, then A*W is generated by wi A w2 A 


Suppose 
e W is (nontrivial) discrete subgroup of R”, and 
e M € SO(n). 

Show d(MW) = d(W). 


Prove Lemma 7.5.4. 


Prove Lemma 7.5.5(1). 


[Hint: Since rescaling does not change the degree of a polynomial, we may 
assume 7’ = 7 = 1. If C' does not exist, then taking a limit results in a 
polynomial of degree < D that is 1 at some point of [0,1], but vanishes on 
all of [1,1 + €].] 


Prove Lemma 7.5.5(2). 


[Hint: Assume, without loss of generality, that T = 7 = 1. The polynomials 
of degree < D that are < 1 on [0,1] form a compact set, so they are uniformly 
bounded by some constant on [1,4?*"]. For T; € {1,4,...,4?}, the intervals 
[T1, 2T2] are pairwise disjoint. If f is not bounded away from 0 on any of these 
intervals, then taking a limit results in a nonzero polynomial of degree < D 
that vanishes at D + 1 distinct points.| 


Prove Lemma 7.5.7. 


[Hint: This is easy if W is generated by scalar multiples of the standard basis 
vectors of R*, and v € R*t"] 


Show that if £ is a discrete subgroup of R", and 1<k <n, then (YL 
is a discrete subset of /\’R”. 


[Hint: By choosing an appropriate basis, you can assume L C Z” | 


Assume 

e Lisa Z-lattice in R”, and 

ed>0. 
Show there are only finitely many full subgroups of £, such that d(W) < 
é. 
[Hint: Exercise 9. (If Wi and W2 are two different k-dimensional subspaces 
of R", then Wi 4 NW.) 
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#11. Complete the proof of Theorem 7.5.1 in the special case where dim(WT) = 
1 (and w= 3). 
[Hint: If there exist v € L\W* and t € [T, 271], such that ||u7 **v|| < 1/C, 
then d(u? t*(Wt,v)c) =7/C for some R € [T1, 2T;].] 


Notes 


See {1, §1] or [7, §4.2] for more information on Siegel sets in SL(n, R), and 
the proof of Theorem 7.0.1 that appears in Sections 7.2 and 7.3. 

A brief discussion of the connection with the reduction theory of positive- 
definite quadratic forms can be found in [1, §2, pp. 20-24]. 

See [7, Prop. 3.12, p. 129] for a proof of Theorem 7.1.1. A generalization 
to other semisimple groups will be stated in Theorem 8.4.9. 

The clever proof in Section 7.4 is by G. A. Margulis [6, Rem. 3.12(ID]. 

Theorem 7.4.6 is due to G. A. Margulis [5]. (Section 7.5 is adapted from 
the nice exposition in [3, Appendix, pp. 162-173], where all details can be 
found.) The result had been announced previously (without proof), and 
J. Tits [8, p. 59] commented that: 


“For a couple of years, Margulis’ proof remained unpublished and 
every attempt by other specialists to supply it failed. When it 
finally appeared ..., the proof came as a great surprise, both for 
being rather short and using no sophisticated technique: it can 
be read without any special knowledge and gives a good idea of 
the extraordinary inventiveness shown by Margulis throughout 
his work” 
The quantitative version stated in Theorem 7.4.7 is due to S. G. Dani [2]. 
See [4] for a recent generalization, and applications to number theory. 
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Part III 


Important Concepts 


Chapter 8 


Real Rank 


§8.1. R-split tori and R-r 


ank 


(8.1.1) Definition. A closed, connected subgroup T of G is a torus if it is 
diagonalizable over C; that is, if there exists g € GL(n,C), such that gTg~ 
consists entirely of diagonal matrices. A torus is R- 


over | 


R; that is, if g may be chosen to be in GL(n,| 


(8.1.2) Examples. 


1) Let A be the identity component of the group of diagonal matrices in 
SL(n,R). Then A is obviously an R-split torus. 


2) 
3) 


(8.1.3) Warning. An! 


torus 


SO(1,1)° is an R-split torus in SL(2,R) (see 
SO(2) is a torus in SL(2,R) that is not R- 
over C (see Exercise 2), but not over R (see 


T” (except in the trivial case n = 0). 


(8.1.4) Remarks. 


1) 


split if it is diagonalizable 
R). 


Exercise 1). 


split. It is diagonalizable 
Exercise 3). 


R-split torus is never homeomorphic to the topologist’s 


If T is an R-split torus, then every element of T is hyperbolic (see Def- 


inition A5.1). In particular, no nonidentity 
unipotent. 


element of T' is elliptic or 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [is a lattice in the semisimple Lie group G C SL(¢, R). 


You can copy, modify, and distribute this work, even for commercial purposes, all without 


asking 


permission. http: //creativecommons.org/publicdoma 


Main prerequisites for this chapter: none. 
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2) When G is compact, every torus in G is isomorphic to SO(2)”, for 
some n. This is homeomorphic to T”, which is the reason for the 
terminology “torus? 


It is a key fact in the theory of semisimple Lie groups that maximal R-split 
tori are conjugate: 


(8.1.5) Theorem. If A, and A are maximal R-split tori in G, then there 
exists g € G, such that Ay = gAog™!. 


This implies that all maximal R-split tori have the same dimension, which 
is called the “real rank” (or “R-rank”) of G, and is denoted rankp G: 


(8.1.6) Definition. rank G is the dimension of a maximal R-split torus A 
in G. This is independent of both the choice of A and the choice of the 
embedding of G in SL(¢, R). 

(8.1.7) Examples. 

1) ranky(SL(n,R)) = —1. (Let A be the identity component of the 
group of all diagonal matrices in SL(n,R).) 

2) We have ranky(SL(n,C)) = rankg(SL(n,H)) = n—1. This is be 
cause only the real diagonal matrices remain diagonal when SL(n, C) 
or SL(n,H) is embedded in SL(2n,R) or SL(4n, R), respectively. 

3) ranky G = 0 if and only if G is compact (see Exercise 9). 


(8.1.8) Proposition. rankg SO(m,n) = min{m,n}. 


min{m,n} ( 


Proof. Since SO(m, 7) contains a copy of SO(1, 1) see Exercise 7), 
and the identity component of this subgroup is an R-split torus (cf. Exer- 
cise 1), we have 

rankg SO(m,n) > dim(SO(1, LAs). =min{m,n}. 

We now establish the reverse inequality. Let A be a maximal R-split 
torus. We may assume A is nontrivial. (Otherwise rankg SO(m,n) = 0, so 
the desired inequality is obvious.) Therefore, there is some nontrivial a € A. 
Since a is diagonalizable over R, and nontrivial, there is an eigenvector v of a, 
such that av 4 v; hence, av = Av for some A # 1. Now, if we let (- | -)m.n be 
an SO(m,n)-invariant bilinear form on R™”, we have 


| Oman = (av | a) inn = Ov | Agi = AAW | mae 
By choosing a to be near e, we may assume A & 1, so AX # —1. Since, 
by assumption, we know \ 4 1, this implies A? 4 1. So we must have 
(v | V)m.n = 0; that is, v is an isotropic vector. Hence, we have shown that 
if the real rank is > 1, then there is an isotropic vector in R™*”. 

By arguing more carefully, it is not difficult to see that if the real rank 
is at least k, then there is a k-dimensional subspace of R™*” that con- 
sists entirely of isotropic vectors (see Exercise 10). Such a subspace is said 
to be totally tsotropic. The maximum dimension of a totally isotropic 
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subspace is min{m,n} (see Exercise 11), so we conclude that min{m,n} > 


rankg SO(m, n), as desired. 


(8.1.9) Remarks. 


1) 


Other classical groups, not just SO(m, 7), have the property that their 
real rank is the maximal dimension of a totally isotropic subspace. More 
concretely, we have 

rankg SU(m, n) = rankg Sp(m,n) = min{m, n}. 
The Mostow Rigidity Theorem (15.1.1) will tell us that if [ is (iso- 
morphic to) a lattice in both G and G,, then G® is isomorphic to G4, 
modulo compact groups. Modding out a compact subgroup does not 
affect the real rank (cf. Exercise 9), so this implies that the real rank 
of G is uniquely determined by the algebraic structure of I. 


Although it is not usually very useful in practice, we now state an 
explicit relationship between TI and rankg G. Let S, be the set of all 
elements y of I’, such that the centralizer Cr(y) is commensurable to a 
subgroup of the free abelian group Z” of rank r. Then it can be shown 
that 


rank, G = ann { r > 0 


T is covered by finitely 
many translates of S,. } . 

We omit the proof, which is based on the very useful (and nontrivial) 
fact that if J’ is any maximal torus of G, then there exists g € G, such 
that gTg~*/(( NgTg*) is compact. 


Exercises for §8.1. 


#1. 


49. 
#3. 


HA, 
#5. 


Show that the identity component of SO(1,1) is an R-split torus. 
(Hint: Let g = i 4] . Alternatively, note that each element of SO(1,1) isa 


symmetric matrix (hence, diagonalizable via an orthogonal matrix), and use 
the fact that any set of commuting diagonalizable matrices is simultaneously 
diagonalizable.] 


For g = i ah show every element of g SO(2)g~+ is diagonal. 


Show that SO(2) is not diagonalizable over R. 


[Hint: If T is diagonalizable over R, then eigenvalues of the elements of T are 
real.| 


Show that every R-split torus is abelian. 


Suppose 
e T is an R-split torus in G, and 
e A is a maximal R-split torus in G. 
Show that T is conjugate to a subgroup of A. 


[Hint: By considering dimension, it is obvious that T is contained in some 
maximal R-split torus of G.] 
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#16. 
#7. 


#8. 


#9. 


410. 


#11. 


#12. 
#13. 


4614, 
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Show that every maximal R-split torus in G is almost Zariski closed. 
Assume m >n. Then m+n > 2n, so there is a natural embedding of 
SO(1,1)” in SL(m+n,R). Show that SO(m,n) contains a conjugate 
of this copy of SO(1, 1)”. 


[Hint: Permute the basis vectors.] 


Prove, directly from Definition 8.1.1, that if G, is conjugate to G2 in 
GL(é,R), then ranky(G1) = ranky(G2). 


Show ranky G = 0 if and only if G is compact. 
[Hint: Remarks A5.2 and A2.6(2).] 


Show that if rankg SO(m,n) = r, then there is an r-dimensional sub- 
space V of R™t”, such that (uv | w)mn = 0 for all v,w EV. 


[Hint: Because A is diagonalizable over R, there is a basis {v1,...,Um+n} 
of R™*" whose elements are eigenvectors for every element of A. Since 
dim A = r, we may assume, after renumbering, that for all A1,...,A, € Rt, 


there exists a € A, such that av; = A;,v;, for 1 <i < r. This implies 
(v1,...,Ur) is totally isotropic.] 


Show that if V is a subspace of R™*” that is totally isotropic for 
(- | -)m.n; then dim V < min{m, n}. 
[Hint: If v £0 and the last n coordinates of v are 0, then (v | v)m,n > 0.] 


Show ranky(G, x G2) =rankyG, + rankp Go. 

Show rankpy G > 1 if and only if G contains a subgroup that is isogenous 
to SL(2, R). 

[Hint: Remark A2.6.] 


Show that [I contains a subgroup that is isomorphic to Z", where r = 
ranke G. 
[Hint: You may assume the fact stated in the last sentence of Remark 8.1.9(3).] 


§8.2. Groups of higher real rank 


In some situations, there is a certain subset S of G, such that the centralizer 
of each element of S is well-behaved, and it would be helpful to know that 
these centralizers generate G. The results in this section illustrate that an 
assumption on the real rank of G may be exactly what is needed. (However, 
we will often only prove the special case where G = SL(3,R). A reader 
familiar with the theory of “real roots” should have no difficulty generalizing 
the arguments.) 


(8.2.1) Proposition. Let A be a maximal R-split torus in G. Then we have 
rankpG > 2 if and only if there exist nontrivial elements a, and ag of A, 
such that G = (Cg(a1),Ca(az2)). 


8.2. GROUPS OF HIGHER REAL RANK 183 


Proof. (=) Assume, for simplicity, that G = SL(3,R). (See Exercise 1(a) 
for another special case.) Then we may assume A is the group of diagonal 
matrices (after replacing it by a conjugate). Let 


2 0 0O 1/4 0 0 
aj=|0 2 O and a=|0O 2 0 
0 0 1/4 0 O 2 
Then 
* * O x 0 0 
Ca(a1) = |* * O and Ce(az)=]0 * * 
0 O x QO « * 


These generate G. 
(<) Suppose ranky G = 1, so dim A = 1. Then, since A is almost Zariski 
closed (and contains (a1)), we have Cg(ai1) = Cg(A) = Ca(az), so 
(Ca(a1),Ca(a2)) = Ca(A). 
It is obvious that Cg(A) 4 G (because the center of G is finite, and therefore 
cannot contain the infinite group A). 


The following explicit description of Cg(A) will be used in some of the 
proofs. 


(8.2.2) Lemma. /f A is any maximal R-split torus in G, then Cg(A) = AxC, 
where C' is compact. 


Proof. (optional) A subgroup of SL(¢,R) is said to be reductive if it is 
isogenous to M x JT’, where M is semisimple and T is a torus. It is known 
that the centralizer of any torus is reductive (see Exercise 2), so, if we assume, 
for simplicity, that Cg(A) is connected, then we may write Cg(A) = M x A, 
where M is reductive (see Exercise 3). The maximality of A implies that MW 
does not contain any R-split tori, so M is compact (see Exercise 8.1#9). 


(8.2.3) Proposition (see Exercise 4). ranky G > 2 if and only if there exist 
a nontrivial hyperbolic element a and a nontrivial unipotent element u, such 
that au = ua. 


For use in the proof of the proposition that follows it, we mention a very 
useful characterization of a somewhat different flavor: 


(8.2.4) Lemma (see Exercise 5). ranky G < 1 if and only if every nontrivial 
unipotent subgroup of G is contained in a unique maximal unipotent sub- 
group. 


(8.2.5) Proposition. ranky G > 2 if and only if there exist nontrivial unipo- 
tent subgroups U;,...,Ux, such that 
e (Ui,...,UK) = G, and 


e U; centralizes Uj41 for each i. 
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Proof. (=) Assume, for simplicity, that G = SL(3,R). Then we take the 


sequence 
bio deo] ve oo rt bro] 
O010;}, JO1O], |O1*}, |* 10], JO10}, /O10]. 
001 001 001 001 * 01 Ox*1 
(<) Since U; commutes with U;41, we know that (U;,U;+1) is unipo- 
tent, so, if rankp G = 1, then it is contained in a unique maximal unipotent 
subgroup U; of G. Since U; and Ujar both contain U;:1, we conclude that 
U; = U;41 for all i. Hence, (U;,...,U,) is contained in the unipotent group 
U,, and is therefore not all of G. O 


(8.2.6) Remark. See Lemma 16.5.7 for yet another result of the same type, 
which will be used in the proof of the Margulis Superrigidity Theorem in 
Section 16.5. A quite different characterization, based on the existence of 
subgroups of the form SL(2, R) k R”, appears in Exercise 13.2#2, and is used 
in proving Kazhdan’s Property (TJ) in Chapter 13. 


We know that SL(2,R) is the smallest group of real rank one (see Exer- 
cise 8.14413). However, the smallest group of real rank two is not unique: 


(8.2.7) Proposition. Assume G is simple. Then rankg G > 2 if and only if 
G contains a subgroup that is isogenous to either SL(3,R) or SO(2, 3). 


Exercises for §8.2. 


#1. Prove the following results in the special case where G = G, Xx G2, and 
rankp G; > 1 for each i. 
a) Proposition 8.2.1(=) 
b) Proposition 8.2.3(=) 
c) Lemma 8.2.4(<) 
d) Proposition 8.2.5(=) 


#2. (optional) It is known that if M is a subgroup that is almost Zariski 
closed, and M7 = M, then M is reductive (cf. Corollary A7.8). As- 
suming this, show that if T is a subgroup of the group of diagonal 
matrices, and G? = G, then Cg(T) is reductive. 


#3. (optional) Suppose M is reductive, and A is an R-split torus in the 

center of M. Show there exists a reductive subgroup L of M°, such 
that M° =LxA. 
[Hint: Up to isogeny, write M = Mo x T, with A C T. Then it suffices to 
show T = E x A for some &. You may assume, without proof, that, since T 
is a connected, abelian Lie group, it is isomorphic to R™ x T” for some m 
and n.| 


#4. a) Prove Proposition 8.2.3(=) under the additional assumption that 
G =SL(8,R). 
b) Prove Proposition 8.2.3(<). 


#5. 


H6. 


HT. 
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Find a nontrivial unipotent subgroup of SL(3,R) that is contained in 
two different maximal unipotent subgroups. 


(Assumes the theory of real roots) Prove the general case of the following 
results. 

a) Lemma 8.2.2 

b) Proposition 8.2.1(=) 

c) Proposition 8.2.3(=>) 

d) Lemma 8.2.4 

e) Proposition 8.2.5(=) 


Show (without assuming G is simple): rankg G > 2 if and only if G con- 
tains a subgroup that is isogenous to either SL(3,R) or SL(2,R) x 
SL(2, R). 

[Hint: Proposition 8.2.7. You may assume, without proof, that SO(2, 2) is 
isogenous to SL(2,R) x SL(2, R).] 


§8.3. Groups of real rank one 


As a complement to Section 8.2, here is an explicit list of the simple groups 
of real rank one. 


(8.3.1) Theorem. Jf G is simple, and rankg G = 1, then G is isogenous to 


either 


(8.3.2) Remark. The special linear groups SL(2,R), SL(2,C) and SL(2, H) 


SO(1,n) for some n > 2, 

SU(1,n) for some n > 2, 

Sp(1,n) for some n > 2, or 

Bae (also known as Fy), a certain exceptional group. 


have real rank one, but they are already on the list under different names, 
because 


1) 
2) 
3) 


SL(2,R) is isogenous to SO(1,2) and SU(1, 1), 
SL(2, C) is isogenous to SO(1,3) and Sp(1,1), and 
SL(2, H) is isogenous to SO(1, 4). 


(8.3.3) Remark. Each of the simple groups of real rank one has a very im- 
portant geometric realization. Namely, SO(1,n), SU(1,n), Sp(1,n), and F441 
(respectively) are isogenous to the isometry groups of: 


1) 
2) 
3) 
4) 


(real) hyperbolic n-space 9”, 
complex hyperbolic n-space CH", 


quaternionic hyperbolic n-space HS", and 


the Cayley plane, which can be thought of as the hyperbolic plane 
over the (nonassociative) ring O of Cayley octonions. 
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§8.4. Minimal parabolic subgroups 


The group of upper-triangular matrices plays a very important role in the 
study of SL(n,R). In this section, we introduce subgroups that play the same 
role in other semisimple Lie groups: 


(8.4.1) Definition. Let A be a maximal R-split torus of G, and let a be a 
generic element of A, by which we mean that Cg(a) = Cg(A). Then the 
corresponding minimal parabolic subgroup of G is 


p={geG 


This is a Zariski closed subgroup of G. 


lim sup ||ja~" ga” || < oo } : (8.4.2) 
n> co 


(8.4.3) Theorem. All minimal parabolic subgroups of G are conjugate. 


(8.4.4) Examples. 
1) The group of upper triangular matrices is a minimal parabolic subgroup 
of SL(n, R). To see this, let A be the group of diagonal matrices, and 
choose a € A with a1,1 > d2,2 >--: > Ann > 0 (see Exercise 1). 


2) It is easier to describe a minimal parabolic subgroup of SO(1,7) if we 


replace Id,» with a different symmetric matrix of the same signature: 
let G = SO(A;R), for 


0 0 1 
A= {0 Id(n—1)x(n—1) 9 
1 0 


Then G is conjugate to SO(1,n) (see Exercise 4), the (1-dimensional) 
group of diagonal matrices in G form a maximal R-split torus, and a 
minimal parabolic subgroup in G is 


t * * 
0 SO(n-1) * 
0 0 1/t 


(see Exercise 2). 


The following result explains that a minimal parabolic subgroup of a 
classical group is simply the stabilizer of a (certain kind of) flag. Recall that 
a subspace W of a vector space V, equipped with a bilinear (or Hermitian) 
form (- | -), is said to be totally isotropic if (W |W) =0. 

(8.4.5) Theorem (see Exercise 3). 
1) A subgroup P of SL(n,R) is a minimal parabolic if and only if there is 


a chain Vo CVi © --: S Vn of subspaces of R” (with dim V; = 7), such 
that 


P={g€SL(n,R)|Vi, gVi = Vi }- 
Similarly for SL(n,C) and SL(n,H), taking chains of subspaces in C” 
or H”, respectively. 
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2) A subgroup P of SO(m,n) is a minimal parabolic if and only if there 
is a chain Vo CV, C--- CV, of totally isotropic subspaces of R™*" 
(with dim V; =i and r = min{m,n}), such that 

P={g€SO(m,n) | Vi, gV; = V; }. 
Similarly for SO(n, C), SO(n, H), Sp(2m,R), Sp(2m, C), SU(m, n) and 
Sp(m,n). 


Note that any upper triangular matrix in SL(n, R) can be written uniquely 
in the form mau, where 


e a belongs to the R-split torus A of diagonal matrices whose nonzero 
entries are positive, 


e misin the finite group M consisting of diagonal matrices whose nonzero 
entries are +1, and 


e u belongs to the unipotent group N of upper triangular matrices with 
1’s on the diagonal. 


The elements of every minimal parabolic subgroup have a decomposition of 
this form, except that the subgroup M may need to be compact, instead of 
only finite: 


(8.4.6) Theorem (Langlands decomposition). If P is a minimal parabolic 
subgroup of G, then we may write it in the form P = Cg(A)N = MAN, 
where 

e A is a maximal R-split torus, 

e M is a compact subgroup of Cg(A), and 

e N is the unique maximal unipotent subgroup of P. 


Furthermore, N is a maximal unipotent subgroup of G, and, for some generic 
a€é A, we have 


 — {u EG | lim a Ua =e \ : (8.4.7) 


Before discussing the proof (which is not so important for our purposes), 
let us consider a few examples: 


(8.4.8) Example. 


1) If G=SL(n, C), then, for the Langlands decomposition of the group P 
of upper-triangular matrices, we may let: 
e A be the group of diagonal matrices in G whose nonzero entries 
are positive real numbers (just as for SL(n,R)), 
e M be the group of diagonal matrices in G whose nonzero entries 
have absolute value 1, and 
e N be the group of upper triangular matrices with 1’s on the diag- 
onal. 
The same description applies to G = SL(n,H) (and, actually, also to 
SL(n, R)). 
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2) Assume m < m, and let G = SO(A;R), where 


0 0 diy 0 1 : 
A= 0 Id(n—m)x(n—m) 0 and a f = a | 
Jie 0 0 i 60 


(and the size of the matrix J, is m x m). Then G is conjugate to 
SO(m,n) (see Exercise 4), and a minimal parabolic P of G is: 


yee b € GL(m, R) is upper triangular, 
0 kx 
0 0 of k € SO(n —m) 


where 21 = Jm(x~!)" Jm, so, for example, 
diag(b1,...,0m)' = diag(1/bm,...,1/b1). 
Hence, we may let 
A= 4 diag ai: Gig One Oa ened ay) lag SO}, 
e M=SO(n—m) x {+1}™, and 
e N be the group of upper triangular matrices with 1’s on the diag- 
onal that are in G. 


Proof of Theorem 8.4.6 (optional). Choose a generic element a of A 
satisfying (8.4.2), and define N as in (8.4.7). Then, since a is diagonalizable 
over R, it is not difficult to see that P = Cg(a) N (see Exercise 5). Since a is 
a generic element of A, this means P = Cg(A) N. 

It is easy to verify that N is normal in P (see Exercise 6); then, since 
P/N & Cq@(A) = A x (compact) (see Lemma 8.2.2), and therefore has no 
nontrivial unipotent elements, it is clear that N contains every unipotent 
element of P. Conversely, the definition of N implies that it is unipotent 
(see Exercise 7). Therefore, N is the unique maximal unipotent subgroup 
Ore. 

Suppose U is a unipotent subgroup of G that properly contains N. Since 
unipotent subgroups are nilpotent (see Exercise 9), then Nu(N) properly 
contains N (see Exercise 10). However, it can be shown that Nc(N) = P 
(see Exercise 8), so this implies Ny(N) is a unipotent subgroup of P that 
properly contains N, which contradicts the conclusion of the preceding para- 
graph. LJ 


The subgroups A and N that appear in the Langlands decomposition of P 
are two components of the Iwasawa decomposition of G: 
(8.4.9) Theorem (Iwasawa decomposition). Let 

e K be a maximal compact subgroup of G, 

e A be a maximal R-split torus, and 

e N be a mazimal unipotent subgroup that is normalized by A. 
Then G= KAN. 
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In fact, every g € G has a unique representation of the form g = kau 
wihk Ee K,a€ A, anducN. 


(8.4.10) Remark. A stronger statement is true: if we define a function y: K x 
Ax N > G by o(k,a,u) = kau, then ¢ is a (real analytic) diffeomorphism. 
Indeed, Theorem 8.4.9 tells us that y is a bijection, and it is obviously real 
analytic. It is not so obvious that the inverse of y is also real analytic, but this 
is proved in Exercise 7.143 when G = SL(n,R), and the general case can be 
obtained by choosing an embedding of G' in SL(n, R) for which the subgroups 
K, A, and N of G are equal to the intersection of G with the corresponding 
subgroups of SL(n, R). 


The Iwasawa decomposition implies KP = G (since AN C P), so it has 
the following important consequence: 


(8.4.11) Corollary. If P is any minimal parabolic subgroup of G, then G/P 
is compact. 


(8.4.12) Remark. A subgroup of G is called parabolic if it contains a minimal 
parabolic subgroup. 


1) Corollary 8.4.11 implies that if Q is any parabolic subgroup, then G/Q 
is compact. The converse does not hold. (For example, if P = MAN 
is a minimal parabolic, then G/(AN) is compact, but AN is not para- 
bolic unless M is trivial.) However, passing to the “complexification” 
does yield the converse: Q is parabolic if and only if Gc/Qc is com- 
pact. Furthermore, Q is parabolic if and only if Qc contains a maximal 
solvable subgroup (“Borel subgroup”) of Gc. 


2) All parabolic subgroups can be described fairly completely (there are 
only finitely many that contain any given minimal parabolic), but we 
do not need the more general theory. 


Exercises for §8.4. 


#1. Let a be a diagonal matrix as described in Example 8.4.4(1), and show 
that the corresponding minimal parabolic subgroup is precisely the 
group of upper triangular matrices. 


#2. Show that the subgroup at the end of Example 8.4.4(2) is indeed a 
minimal parabolic subgroup of SO(A;R). 


#3. Show the minimal parabolic subgroups of each of the following groups 
are as described in Theorem 8.4.5: 
a) SL(n,R). 
b) SO(m, n). 
[Hint: It suffices to find one minimal parabolic subgroup in order to under- 
stand all of them (see Theorem 8.4.3).] 


190 


HEA. 


#5. 


#6. 


HT. 


HE, 


#9. 


410. 


#11. 
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For A as in Example 8.4.8(2), show that SO(A;R) is conjugate to 
SO(m, n). 

[Hint: Let a = 1/\/2, and define v; to be: a(e; + en41-i) for i < m, e; for 
m <i<n, and a(e; — eny1-i) for i > n. Then v/ Av; is 1 for i <n, and is 
—lfori>n.] 

(optional) For P, a, and N as in the proof of Theorem 8.4.6, show 
P= Ca(a) N. 

[Hint: Given g € P, show that a~"ga” converges to some element c of Cg(a). 
Also show c-'g € N. You may assume a is diagonal, with a11 > a22 >--- > 


ace (why?).| 
For P, a, and N as in the proof of Theorem 8.4.6, show N is a normal 
subgroup of P. 


Show that a subgroup WN satisfying (8.4.7) must be unipotent. 


[Hint: u has the same characteristic polynomial as a~"ua”.] 


For P and N as in Theorem 8.4.5(2), show P = No(N). 
(Hint: P is the stabilizer of a certain flag, and the subgroup N also uniquely 
determines this same flag.] 
Show that every unipotent subgroup of SL(@,R) is nilpotent. (Recall 
that a group N is nilpotent if there is a series 
{e} = Nod-:-IN-=N 
of subgroups of N, such that [N,N] C Nz—1 for each k.) 
[Hint: Engel’s Theorem (A5.7).] 


Show that if N is a proper subgroup of a nilpotent group U, then 


Nu(N) EN. 
[Hint: If [N,Uz] C N, then U; normalizes N.| 


Assume K is a maximal compact subgroup of G. Show: 
a) G is diffeomorphic to the cartesian product kK x R”, for some n, 
b) G/K is diffeomorphic to R”, for some n, 
c) G is connected if and only if kK is connected, and 
d) G is simply connected if and only if K is simply connected. 
[Hint: Remark 8.4.10.] 


Notes 


The comprehensive treatise of Borel and Tits [1] is the standard reference 
on rank, parabolic subgroups, and other fundamental properties of reductive 
groups over any field. See [5, 87.7, pp. 474-487] for a discussion of parabolic 
subgroups of Lie groups (which is the special case in which the field is R). 

Remark 8.1.9(3) is due to Prasad-Raghunathan [7, Thms. 2.8 and 3.9]. 

Proofs of the Iwasawa decomposition for both SL(n,R) (7.1.1) and the 
general case (8.4.9) can be found in [6, Prop. 3.12, p. 129, and Thm. 3.9, 
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p. 131]. (Iwasawa’s original proof is in [4, §3].) The decomposition also 
appears in many textbooks on Lie groups. In particular, Remark 8.4.10 is 
proved in [3, Thm. 6.5.1, pp. 270-271]. 

Regarding Remark 8.4.12(1), the obvious cocompact subgroups of G' are 
parabolic subgroups and (cocompact) lattices. See [8] for a short proof that 
every cocompact subgroup is a combination of these two types. (A similar 
result had been proved previously in [2, (5.1a)].) 
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Chapter 9 


Q-Rank 


Algebraically, the definition of real rank extends in a straightforward way to 
a notion of rank over any field: if G is defined over F’, then we can talk about 
rankr G. In the study of arithmetic groups, we assume G is defined over Q, 
and the corresponding Q-rank is an important invariant of the associated 
arithmetic group [ = Gz. 

Disclaimer. The reading of this chapter may be postponed without severe conse- 
quences (and can even be skipped entirely), because the material here will not arise 
elsewhere in this book (except marginally) other than in Chapter 19, where a coarse 
fundamental domain for I will be constructed. Furthermore, unlike the other chap- 
ters in this part of the book, the topic is of importance only for arithmetic groups 
and closely related subjects, not a broad range of areas of mathematics. 


§9.1. Qrank 


(9.1.1) Definition. Assume G is defined over Q. A closed, connected sub- 
group T of G is a Q-split torus if 


e T is defined over Q, and 


e T is diagonalizable over Q. (That is, there exists g € GL(,Q), such 
that gTg~* consists entirely of diagonal matrices.) 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [is a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. _ http: //creativecommons.org/publicdomain/zero/1.0/ 


Ma rank and minimal parabolic 
subgroups (Chapter 8), and groups defined over Q (Definition 5.1.2). 
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(9.1.2) Example. 
1) SO(1,1)° is a Qsplit torus, because gSO(1,1)g~! consists of diagonal 


matrices if g = i x ‘ 


2) Although it is obvious that every Q-split torus is an R-split torus, the 
converse is not true (even if the torus is defined over Q). For example, 
let T = SO(x? — 2y?;R)°. Then T is defined over Q, and it is R-split 
(because it is conjugate to SO(1,1)°). However, it is not Q-split. To 


see this, note that 3 4] € To, but the eigenvalues of this matrix are 


irrational (namely, 3422), so this rational matrix is not diagonalizable 
over Q. 


The following key fact implies that the maximal Q-split tori of G all have 
the same dimension (which is called the “Q-rank” ): 


(9.1.3) Theorem. Assume G is defined over Q. If S; and Sz are maximal 
Q-split tori in G, then S, = gS2g~' for some g € Gg. 
(9.1.4) Definition (for arithmetic lattices). Assume 
e G is defined over Q, and 
e I’ is commensurable to Gz. 
Then rankg I is the dimension of any maximal Q-split torus in G. 
(More generally, if ¢: G/K + G'/K’, where K and K’ are compact, 


and (I) is commensurable to G%, (see Definition 5.1.19), then rankg TI is the 
dimension of any maximal Q-split torus in G’.) 


(9.1.5) Examples. 


1) rankg(SL(n, Z)) = n—1. (Let S be the identity component of the 
group of all diagonal matrices in SL(n, R).) 


2) Let G = SO(Q;R), where Q(x1,..., x) is some quadratic form on R*, 
such that Q is defined over Q. (That is, all of the coefficients of Q 
are rational.) Then G is defined over Q, and the discussion of Exam- 
ple 8.1.7, with Q in place of R, shows that rankg Gz is the maximum 
dimension of a totally isotropic Q-subspace of Q*. 

(a) For example, rankg SO(m,n)z = min{m,n}. Similarly, 


rankg SU(m, n)z = rankg Sp(m,n)z = min{m, n}. 
So rankg Gz = rank G for these groups. 

(b) Let G = SO(a? +23 +23 —727;R). Then, because the congruence 
a? +b? +c? +d? = 0 (mod 8) implies that all the variables are even, 
it is not difficult to see that this quadratic form has no nonzero 
isotropic vectors in Q* (see Exercise 4). This means rankg Gz = 0. 

Note, however, that G is isomorphic to SO(3,1), so its real 
rank is 1. Therefore, rankg Gz # rankg G. 
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3) rankgT = 0 if and only if G/T is compact (see Exercise 5). 

4) rankgSU(B,7;Op) is the dimension (over D) of a maximal totally 
isotropic subspace of D”, if B is a r-Hermitian form on D”, and D is 
a division algebra over F’. 


(9.1.6) Warning. In analogy with Exercise 8.1413 and Exercise 5.3#7(e), 
one might suppose that rankg I ¥ 0 if and only if I contains a subgroup that 
is isomorphic to SL(2, Z) (modulo finite groups). However, this is false: every 
lattice in G contains a subgroup that is abstractly commensurable to SL(2, Z) 
(unless G is compact). Namely, the Tits Alternative tells us that [ contains 
a nonabelian free subgroup (see Corollary 4.9.2), and it is well known that 
SL(2, Z) has a finite-index subgroup that is free (see Exercise 4.945). 


(9.1.7) Remarks. 


1) The definition of rankg I is somewhat indirect, because the Q-split tori 
of G are not subgroups of [’. Therefore, it would be more correct to say 
that we have defined rankg Gg. 


2) Although different embeddings of G in SL(¢, IR) can yield maximal Q- 
split tori of different dimensions, the theory of algebraic groups shows 
that the Q-rank is the same for all of the embeddings in which IT is 
commensurable to Gz (see Corollary 9.4.7); therefore, rankgI is well 
defined as a function of I. 


3) We have 0 < rankgT < rankg G, since every Q-split torus is R-split. It 

can be shown that: 

(a) The extreme values are always realized: there exist lattices To 
and I, in G, such that rankglo = O and rankgI, = rankyp G 
(see Theorem 18.7.1 and Exercise 7). 

(b) In some cases, there are intermediate values that are not realized. 
For example, the Q-rank of every lattice in SO(2, 5) is either 0 or 2 
(see Corollary 18.6.2). 


4) Suppose I is defined by restriction of scalars (5.5.8), so [ is commen- 
surable to Go, where G’ is defined over a finite extension F of Q, and 
O is the ring of integers of F. Then rankgT is equal to the “F-rank” 
of G’, or, in other words, the maximal dimension (over F’,) of a sub- 
group of G’ that is diagonalizable over F’. For example, the Q-rank of 
SO(B; QO) is the dimension of a maximal totally isotropic F'-subspace 

of F”. 
Definition 9.1.4 applies only to arithmetic lattices, but the Margulis Arith- 
meticity Theorem (5.2.1) allows the definition to be extended to all lattices: 


(9.1.8) Definition (see Exercise 6). Up to isogeny, and modulo the maximal 
compact factor of G, we may write G = G; x --- x G,, so that [; =G; is 
an irreducible lattice in G; for i= 1,...,r (see Proposition 4.3.3). We let 


rankg(C) = rankg(T1) + --- + rankg(Ts), 
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where: 
1) If G/T; is compact, then rankg IT; = 0. 
2) If G/T; is not compact, and rankg G = 1, then rankgT; = 1. 
3) If G/T; is not compact, and rankg G > 2, then the Margulis Arithmetic- 


ity Theorem (5.2.1) implies that [; is arithmetic, so Definition 9.1.4 
applies. 


Exercises for §9.1. 


#1, 
#2. 


#3. 


4A, 


#5. 


#6. 


#7. 


Show that if T is a Q-split torus, then Tz is finite. 


Give an example of a torus T (that is defined over Q), such that Tz is 
infinite. 


Verify the claim in Example 9.1.5(2) that rankg SO(Q; Z) is the dimen- 
sion of a maximal totally isotropic subspace of Q*. 


Verify the claim in Example 9.1.5(2b) that (0, 0,0, 0) is the only solution 
in Q* of the equation x? + x3 + x2 — 7x3 = 0. 


Prove Example 9.1.5(3). 
[Hint: (=) See Exercise 5.3#7. (<) If a is diagonalizable over Q, then there 
exists v € Z*, such that a"v + 0 as n > +00, so the Mahler Compactness 
Criterion (4.4.7) implies G/Gz is not compact.| 
Show that Definition 9.1.8 is consistent with Definition 9.1.4. More 
precisely, assume I is arithmetic, and prove: 

a) G/T is compact if and only if rankgT = 0. 

b) If G/T is not compact, and rankg G = 1, then rankgT = 1. 

c) If f =1y x [2 is reducible, then rankg [ = rankg Ty + rankgI2. 


Suppose G is classical. Show that, for the natural embeddings described 
in Examples A2.3 and A2.4, we have rankg Gz = rankr G. 
[Hint: Example 9.1.5(1,2)).] 


§9.2. Lattices of higher Q-rank 


This section closely parallels Section 8.2, because the results there on semisim- 
ple groups of higher real rank can be extended in a natural way to lattices of 
higher Q-rank. 


(9.2.1) Assumption. Throughout this section, if the statement of a result 
mentions Gg, Gz, or a Q-split torus in G, then G is assumed to be defined 
over Q. 


(9.2.2) Proposition (see Exercise 1). Let S be any maximal Q-split torus 


inG. 


Then we have rankg Gz > 2 if and only if there exist nontrivial elements 


81 and 82 of Sg, such that G = (Ce(s1),Ca(s2))- 
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(9.2.3) Lemma. If S is any mazimal Q-split torus in G, then we have 
Ca(S)=Sx M=SxCL, where 


e M,C, and L are defined over Q, 
e rankg M =0, 
e L is semisimple, and 


e C is a torus that is the identity component of the center of M. 


(9.2.4) Proposition (see Exercise 3). rankg Gz > 2 if and only if there exist 
nontrivial elements a and u of Gg, such that a belongs to a Q-split torus of G, 
u is unipotent, and au = ua. 


(9.2.5) Lemma (see Exercise 4). Assume [ is commensurable to Gz. The 
following are equivalent: 


1) rankgI <1. 


2) Every nontrivial unipotent subgroup of T is contained in a unique 
maximal unipotent subgroup of I. 


3) Every nontrivial unipotent Q-subgroup of G is contained in a unique 
maximal unipotent Q-subgroup of G. 


(9.2.6) Proposition. rankgIT > 2 if and only if T contains nontrivial unipo- 
tent subgroups U;,...,Ux, such that 


e (U;,...,Ux%) is a finite-index subgroup of T, and 


e U; centralizes U;+1 for each i. 


(9.2.7) Proposition. Assume I is irreducible. Then rankgI > 2 if and 
only if T contains a subgroup that is commensurable to either SL(3,Z) or 
SO(2, 3)z. 


(9.2.8) Remarks. 


1) Unfortunately, the list of lattices of Q-rank one is longer and much 
more complicated than the list of simple groups of real rank one in 
Theorem 8.3.1. The classical arithmetic groups (of any Q-rank) are de- 
scribed in Chapter 18 (see the table on 380), but there are also infinitely 
many different lattices of Q-rank one in exceptional groups of type E¢ 
and Fy, and the nonarithmetic lattices of Q-rank one in SO(1,n) and 
SU(1, 7) have not yet been classified. 


2) Suppose rankgI < 1. Proposition 9.2.6 shows that it is impossible to 
find a generating set {71,...,7,} for , such that each y; is nontrivial 
and unipotent, and y; commutes with y;;1, for each 7. However, it is 
possible, in some cases, to find a generating set {71,...,7,} that has 
all of these properties except the requirement that 7; is unipotent. For 
example, this is easy (up to finite index) if [is reducible (see Exercise 7). 
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Exercises for §9.2. 


#1. 


4.2 


#3. 


4A, 


#5. 
#6. 


#7. 


#8. 


a) Prove Proposition 9.2.2(=) for the special case where we have 
Gg = SL(3, Q). 
b) Prove Proposition 9.2.2(<). 


Prove the following results in the special case where [ =I, x Tg, and 
rankg I; > 1 for each 2. 
a) Proposition 9.2.2(=) 
) Proposition 9.2.4(=) 
c) Lemma 9.2.5(<) 
d) Proposition 9.2.6(=) 
) 


a) Prove Proposition 9.2.4(=) in the special case where we have Gg = 


SL(3, Q). 
b) Prove Proposition 9.2.4(<). 


For each of these groups, find a nontrivial unipotent subgroup that is 
contained in two different maximal unipotent subgroups. 

a) SL(3, Q). 

b) SL(8, Z). 


Prove Proposition 9.2.6. 


(Assumes the theory of Q-roots) Prove the general case of the following 
results. 
a) Proposition 9.2.2. 
) Lemma 9.2.3. 
c) Proposition 9.2.4(=). 
) Lemma 9.2.5. 
) Proposition 9.2.6. 
f) Proposition 9.2.7. 


Show that if is reducible, and G has no compact factors, then there 
is a finite subset {71,...,7,} of P', such that 

a) {y1,---,7r} generates a finite-index subgroup of I, 

b) each y; is nontrivial, and 

c) 7¥, commutes with 7:41, for each 7. 


Let [ be a torsion-free, cocompact lattice in SL(3, R), constructed as in 
Proposition 6.7.4. Show that if y, and y2 are any nontrivial elements 
of I’, such that 7, commutes with 7, then Cr(71) = Cr(y2). (Hence, it 
is impossible to find a sequence of nontrivial generators of I’, such that 
each generator commutes with the next.) 

[Hint: Let D = ¢(L?), so D is a division ring of degree 3 over Q. Then 


Cp(71) is subring of D that contains the field Q[71] in its center. Because the 
degree of D is prime, we conclude that Cp(y1) = Qi] © Cpo(72).] 
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§9.3. Minimal parabolic Q-subgroups 


Minimal parabolic subgroups of G play an important role in the study of 
arithmetic subgroups, even when they are not defined over Q. However, for 
some purposes (especially when we construct a coarse fundamental domain in 
Chapter 19), we want a subgroup that is both defined over Q and is similar 
to a minimal parabolic subgroup: 


(9.3.1) Definition (cf. Definition 8.4.1). Let S be a maximal Q-split torus 
of G, and let a be a generic element of S. Then the corresponding minimal 
parabolic Q-subgroup of G is 


PS {9 eG | lim sup ||a~" ga" || < co } : 
NCO 
This is a Zariski closed subgroup of G that is defined over Q. 


(9.3.2) Examples. 


1) Since the group of upper triangular matrices is a minimal parabolic Q- 
subgroup of SL(n,R), we see that, in this case, the minimal parabolic 
Q-subgroup is also a minimal parabolic subgroup. 


2) This is a special case of the fact that if rankg Tl = rankg G, then every 
minimal parabolic Q-subgroup is also a minimal parabolic subgroup 
(see Exercise 1). 


3) (Cf. Theorem 8.4.5(2)) Suppose Q is a nondegenerate quadratic form 
on Q* that is defined over Q. A subgroup P of SO(Q;R) is a minimal 
parabolic Q-subgroup if and only if there is a chain Vo € Vi € --: CV, 
of totally isotropic subspaces of Q*, such that 

e dim V; = 7, for each i, 
e V,. is a maximal totally isotropic subspace, and 
e P={g€SO(Q;R) | Vi, gVi = Vi }- 


We have a Langlands decomposition over Q. However, unlike in the real 
case, where the subgroup M is compact (i.e., rankg M = 0), we now have a 
subgroup that may be noncompact (but whose Q-rank is 0): 


(9.3.3) Theorem (Langlands decomposition). Jf P is a minimal parabolic 
Q-subgroup of G, then we may write P in the form P = MSN = LCSN, 
where 


1) M, S, N, L, and C are defined over Q, 


S is a maximal Q-split torus, 


M = LC, where L is semisimple and C' is the identity component of 
the center of M, and 
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6) N is the unipotent radical of P; that is, the unique maximal unipotent 
normal subgroup of P. 


Furthermore, for some a € Sg, we have 


P= {9 €G| limsup ||a~"ga"|| < oo (9.3.4) 
noo 
and 
N= {9 EG | lim a Ga. =e \ ; (9.3.5) 


Proof. The examples and proof are essentially the same as for the real Lang- 
lands decomposition (8.4.6), but with Q in the place of R, and groups of 
Q-rank 0 in place of compact groups. O 


(9.3.6) Proposition. Assume G is defined over Q, and P is a minimal par- 
abolic Q-subgroup, with Langlands decomposition P = MSN. Then: 
1) Every minimal parabolic Q-subgroup of G is Gg-conjugate to P. 
2) Every unipotent Q-subgroup of G is Gg-conjugate to a subgroup of N. 
3) P=Ne(N) = Na(P). 


(9.3.7) Definition. For P, M, S, N, L, and C as in Theorem 9.3.3, the 
positive Weyl chamber of S (with respect to P) is the set S* of all el- 
ements a of S, such that P is contained in the right-hand side of (9.3.4). 
(Equivalently, it is the closure of the set of elements a of S for which equality 
holds in (9.3.4).) 


Exercises for §9.3. 


##1. Show that if we have rankg I = rankg G, then every minimal parabolic 
Q-subgroup is also a minimal parabolic subgroup. 


[Hint: Choose A to be both a maximal Q-split torus and a maximal R-split 
torus. | 


#2. Show that the converse of Exercise 1 is not true. 
[Hint: Proposition 6.6.1.] 


#3. Show that every minimal parabolic Q-subgroup of G contains a minimal 
parabolic subgroup. 
[Hint: Choose a maximal Q-split torus S. Then choose a maximal R-split 
torus A that contains S. There is a generic element of A that is very close to 
a generic element of S.] 

#4. If P is any minimal parabolic Q-subgroup of G, and K is any maximal 
compact subgroup of G, show that G = KP. 


[Hint: The Iwasawa decomposition (8.4.9) tells us G = KAN, and some 
conjugate of AN is contained in P.| 


#5. Assume the notation of Theorem 9.3.3. Show that if rankg G = 1, then 
there is an isomorphism ¢: S > R, such that OS?) = 
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#6. Show that if U is a unipotent Q-subgroup of SL(@,R), then Uz is a 
cocompact lattice in U. 


(Hint: Induct on the nilpotence class of U (see Exercise 8.449). Note that 
the exponential map exp: u —> U is a polynomial with rational coefficients, 
as is its inverse, so Uz is Zariski dense in U.] 


#7. Show that if U; and U2 are maximal unipotent subgroups of I’, and I is 
commensurable to Gz, then there exists g € Gg, such that g~'Ujg is 
commensurable to U9. 


§9.4. Isogenies over Q 


We have seen examples in which G' is isogenous (or even isomorphic) to G’, 
but the arithmetic subgroup Gz is very different from GZ. (For example, it 
may be the case that G'z is cocompact, but G% is not.) This does not happen 
if the isogeny is defined over Q, in the following sense: 


(9.4.1) Definition. 
1) A homomorphism $: G — G" is defined over Q if ¢(GQ) C Gg. 


2) G, is tsogenous to G2 over Q (denoted G, ¥g G2) if there is a 
group G that is defined over Q, and isogenies ¢;: G — G; that are 
defined over Q. 


The following result shows that any isogeny over Q can be thought of as 
a polynomial with rational coefficients. 


(9.4.2) Definition. A function ¢: G > G’ is a polynomial with rational 
coefficients if 


e the matrix entries of ¢(g) can be written as polynomial functions of the 
coefficients of g, and 


e the polynomials can be chosen so that all of their coefficients are in Q. 


(9.4.3) Proposition. If G,; +g G2, then there is a group G that is defined 
over Q, and isogenies ¢;: G > G; for i = 1,2, that are polynomials with 
rational coefficients. 


Proof. Given isogenies ¢;: G — G; that are defined over Q, let 


G'= { (¢1(g), ¢2(9)) | geG i‘. 
This is defined over Q, since Gg is dense (see Proposition 5.1.8). The pro- 
jection maps ¢,: G’ — G; defined by ¢/(g1,92) = gi are polynomials with 
rational coefficients. 
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(9.4.4) Warning. There are examples where ¢: G1 — G2 is an isomorphism, 
and ¢ is a polynomial, but @~! is not a polynomial. For example, the nat- 
ural homomorphism ¢: SL(3,R) — PSL(3,R)° is an isomorphism (because 
SL(3, R) has no center). However, there is no isomorphism from PSL(3, C) to 
SL(3,C) (because one of these groups has a center and the other does not), 
so the inverse of ¢ cannot be a polynomial (because it does not extend to a 
well-defined map between the complexifications). 


The following fundamental result implies that different embeddings of G 
with the same Q-points have essentially the same Z-points. 


(9.4.5) Proposition. Suppose ¢: G > G’ is a surjective homomorphism that 
is defined over Q. Then (Gz) is commensurable to Gf. 


Proof. From the proof of Proposition 9.4.3, we see that, after replacing G 
with an isogenous group, we may assume that ¢ is a polynomial with rational 
coefficients. Assume G C SL(¢, R) and G’ C SL(m, R). 

Define ¢: G 3 Matmxm(R) by (zx) = (a —Id). Then @ is a polyno- 
mial, so it is defined on all of Mat,,,(R). Since the coefficients are in Q, 
there is some nonzero n € N, such that o(n Mat (R)) C Matmxm(Z). 
Therefore, letting [,, be the “principal congruence subgroup” of Gz of level n 
(see page 68), we have ¢(T,) C Gz. 

Because I, is a lattice in G (and $(T,,) is discrete), we know that $(T,,) 
is a lattice in G’. Since ¢([,) C Gz, this implies that (I) is commensurable 
to GZ (see Exercise 4.1410). oO 


The following fundamental fact is, unfortunately, not obvious from our 
definition of “Q-split” 
(9.4.6) Proposition. Assume 

e T and H are connected Lie groups that are defined over Q, and 
Then T is a Q-split torus if and only if H is a Q-split torus. 


(9.4.7) Corollary. If G +g G’, then rankg Gz = rankg Gz. 


Proof. Suppose G is a Q- group, and there is an isogeny y;: G > G; that 
is defined over Q for 7 = 1,2. If J) is a maximal Q-split torus in G,, then 
Proposition 9.4.6 implies that ye (yy, '(T1)°) is a Qsplit torus in G2. Since 
isogeny preserves dimension, we conclude that rankgG,; < rankgG»2. By 
symmetry, equality must hold. L] 


Notes 


As was mentioned in the notes of Chapter 8, the comprehensive treatise of 
Borel and Tits [3] is the standard reference on rank, parabolic subgroups, and 
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other fundamental properties of reductive groups over any field (including Q). 
Abbreviated accounts can be found in many texts, including [1, §10 and §11] 
and [2, Chap. 5]. 

See [1, Rem. 8.11, p. 60] for a proof of Proposition 9.4.5. 
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Chapter 10 


Quasi-lsometries 


§10.1. Word metric and quasi-isometries 


The field of Geometric Group Theory equips groups with a metric, which 
allows them to be studied as metric spaces: 


(10.1.1) Definition. Fix a finite generating set S of I (see Theorem 4.7.10), 
and assume, for simplicity, that S is symmetric, which means s~! € S for 
every sE S. 
1) For g ET, the word length of g is the length ¢ of the shortest sequence 
(51, S2,...,5¢) of elements of S, such that s;59---s~; = g. It is denoted 
€(g). (By convention, £(e) = 0.) 
2) For g,h € T, we let d(g,h) = €(g~th). This defines a metric on I, 
called the word metric (see Exercise 1). 


The word metric has the important property that the action of I on itself 
by left-translations is an action by isometries (see Exercise 2). 


(10.1.2) Remark. The word metric can be pictured geometrically, by con- 
structing a Cayley graph. Namely, Cay(I;S) is a certain 1-dimensional 
simplicial complex (or “graph” ): 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [ is a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. _ http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: none. 
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e its 0-skeleton is I’, and 
e it has a 1-simplex (or “edge”) of length 1 joining v to vs, for every v € [ 
ands €S. 
Define a metric on Cay(T; $) by letting d(x, y) be the length of the shortest 
path from x to y. Then the restriction of this metric to the 0-skeleton is 
precisely the word metric on I. 


Unfortunately, the word metric on I’ is not canonical, because it depends 
on the choice of the generating set S' (see Exercise 3). However, it is “almost” 
well-defined, in that changing the generating set can only distort the distances 
by a bounded factor. This idea is formalized in the following notion: 


(10.1.3) Definition. Let X; and X2 be metric spaces, with metrics d; and d2, 
respectively. 
1) A function f: X; > X2 is a quasi-isometry if there is a constant 
C' > 0, such that 
(a) for all x,y € X, with d,(z,y) > C, we have 
1 2 do(f(z), f(y)) 
C di (x,y) 


a, 


and 
(b) for all v2 € X92, there exists 7; € X1, such that 
dy (f (x1), x2) <C, 
Note that f need not be continuous (and need not be one-to-one or 
onto, either). 
2) We say Xj is quasi-isometric to X2 (and write X41 ns X2) if there 
is a quasi-isometry from X, to X2. This is an equivalence relation 
(see Exercise 4). 


(10.1.4) Proposition (see Exercise 5). Let 
e S; and S_ be two finite, symmetric generating sets for Tl, and 
e d; be the word metric on T corresponding to the generating set S;. 


Then (I, d1) © (1, dz). 


Therefore, if ; and Ig are quasi-isometric for some choice of the word 
metrics on the two groups, then they are quasi-isometric for all choices of the 
word metrics. So it makes sense to say that two groups are quasi-isometric, 
without any mention of generating sets (as long as both of the groups are 
finitely generated). 


(10.1.5) Remark. A property is said to be geometric if is is invariant under 
quasi-isometry. For example, we will see (in Proposition 12.7.22 and Re- 
mark 13.4.10, respectively) that amenability is a geometric property, but 
Kazhdan’s property (1) is not. In other words, if Aj aS As, and A, is 
amenable, then Az is amenable, but the same cannot be said for Kazhdan’s 
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property (7). In general, quasi-isometric groups can be very different from 
each other, so most of the usual algebraic properties of groups are not geo- 
metric. 


Quasi-isometries also arise from cocompact actions. Before stating the 
result, we introduce some terminology. 


(10.1.6) Definition. Let (X,d) be a metric space, and let C > 0. 
1) X is proper if the closed ball B,(x) is compact, for all r > 0 and all 


rEX. 
2) Let x,y € X. A (C)-coarse geodesic from x to y is a finite sequence 
Dy, Lisios soe x, Such that oo = 7, es, = and 


|d(x;, 25) — |i — j|| <0 for all7; 4. 
3) X is (C-)coarsely geodesic if, for all x,y € X, there is a C-coarse 
geodesic from x to y. 


(10.1.7) Proposition (see Exercise 6). Suppose 

e (X,d) is a metric space that is proper and coarsely geodesic, 

e I has a properly discontinuous action on X by isometries, such that 

T'\X is compact, and 

e d’ is a word metric on T. 
Then (I, d’) © (X,d). 

More precisely, for any basepoint x9 € X, the map y > yao is a quasi- 
isometry from T to X. 


(10.1.8) Corollary. If G/T is compact, then the inclusion T > G is a quasi- 
isometry, where we use any word metric on T, and we use any (coarsely 
geodesic, proper) metric on G that is invariant under left-translations. 


This implies that any two cocompact lattices in the same group are quasi- 
isometric: 


(10.1.9) Corollary. If [, and [2 are cocompact lattices in G, then 1; ~ To. 


Proof. We have I; GS Ts, sol, ary by transitivity. 


We will see in Section 15.4 that the situation is usually very different 
for lattices that are not cocompact: in most cases, there are infinitely many 
different (noncocompact) lattices in G that are not quasi-isometric to each 
other. 

Any (coarsely geodesic, proper) metric on G provides a metric on I’, by 
restriction. In most cases, this restriction is the word metric (up to quasi- 
isometry): 


(10.1.10) Theorem (Lubotzky-Mozes-Raghunathan). Jf rankg G > 2, and 
T is irreducible, then the inclusion T 4 G is a quasi-isometry onto its image. 
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The assumption that rankp G > 2 is essential: 


(10.1.11) Example. Let 
e G=SL(2,R), 
e I bea free subgroup of finite index in SL(2, Z) (see Exercise 4.945), 


Then: 


1) For any word metric dp on I, the function dp(u",e) grows linearly 
with n, because I is free. 

2) For any left-invariant metric dg on G, the function dg(u”, e) grows only 
logarithmically, because a!°2"uaq 8" = u2”. 


Therefore, the restriction of dg to T is not quasi-isometric to dp. 


Exercises for §10.1. 


#1. Show that the word metric is indeed a metric. More precisely, for 
z,y,z €T, show 
d(x,y) > 0, deg) =O: > eS y, 
d(x,y) = d(y, x), d(x,y) < d(x, z) + d(z,y). 


#2. Assume d is a word metric on TI (with respect to a finite, symmetric 
generating set S'). Show that d(ax,ay) = d(z,y) for alla,z,y ET. 
[Warning: d(xa, ya) is usually not equal to d(z, y).] 


#3. Assume [ is infinite. Show there exist two word metrics d; and dy 
on [ (corresponding to finite, symmetric generating sets S; and S2), 
such that the metric space (I'1,d,) is not isometric to the metric space 
(T2, dz). 


[Hint: A ball of radius r can have different cardinality for the two metrics.] 
#4. Show that © is an equivalence relation. 


#5. Prove Proposition 10.1.4. 
[Hint: Show the identity map is a quasi-isometry from (I, di) to (I, dz), by 
choosing C’ so that di(e,s) < C for each s € 5%. 

#6. Prove Proposition 10.1.7. 


[Hint: Assume SD {y]| da € X, d(yx,x) < 3C}, andT’- Bc(x) = X for all 
xe xX. Given 20,%1,...,2n € X with d(x, 2141) < C, there exists y; € T, 
such that d(yi20, ri) < C, so €(¥n) < n.] 
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§10.2. Hyperbolic groups 


Manifolds of negative sectional curvature play an important role in differen- 
tial geometry, both in applications and as a source of examples. The triangles 
in these manifolds have a very special “thinness” property that we will now 
explain. Groups whose triangles have this same property are said to be “neg- 
atively curved? or, in our terminology, “Gromov hyperbolic? 


(10.2.1) Definition (Gromov). Let 6 > 0, and let X be a C-coarsely geodesic 
metric space. 


1) A (C-coarse) triangle abc in X is a set {Sap, Sdc, Sac}, Where Spy is a 
C-coarse geodesic from x to y for x,y € {a,b,c}. 


2) A triangle abc is 6-thin if each of the three sides of the triangle is 
contained in the (closed) d-neighborhood of the union of the other two 
sides. That is, each point in sg, is at distance no more than 6 from 
some point in sg. U sy-, and similarly for sg. and Spe. 


3) X is Gromov hyperbolic if there exists some 6 > 0, such that every 
(C-coarse) triangle in X is 6-thin. 


(10.2.2) Theorem. The universal cover of any compact manifold of strictly 
negative sectional curvature is Gromov hyperbolic. 


Idea of proof. As an illustration, let us show that the hyperbolic plane $? 
is Gromov hyperbolic. We use the disk model. 

Any three distinct points a, b, and c on 
OS)” are the vertices of an ideal triangle (with 
geodesic sides). Choose a point p on ab. Since 
the geodesic ray pa is asymptotic to cd, there 
is some 6 > 0, such that every point of pa is? <3 
in the 5-neighborhood of cé. Similarly, every | 
point of pb is in the 6-neighborhood of zi (after | 
we enlarge 6). Therefore, all of ab is in the ‘' 
d-neighborhood of the union of the other two 
sides. By applying the same argument to the : 
sides bc and Gé, we see there is some 6, such 
that the triangle abc is 6-thin. 

Since the isometry group SL(2,R) acts transitively on the (unordered) 
triples of distinct points on the boundary, we conclude that every ideal triangle 
is 6-thin for this same value of 6. Having vertices on the boundary is the worst- 
case scenario, so this implies that all geodesic triangles in §)? are 5-thin. 


The following important “shadowing property” tells us that “quasi-geodesics” 
are always close to geodesics. Unfortunately, its proof is somewhat lengthy, 
so we omit it. (See Exercises 1 and 2 for some weaker results that are easier.) 
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(10.2.3) Theorem. Suppose 
e X is a C-coarsely geodesic 6-hyperbolic metric space, and 


e {X0,%1,..-,Un} is finite sequence of points in X, such that, for all i,j 
we have: 


—-C < d(x, 2;) <Cli-gl+C. 


Then there exists C'’ > 0 (depending only on C and 6), such that the set 
{£0,%1,---,Xn} is contained in the C'-neighborhood of every C-coarse geodesic 
from xo to x4. 


ji = J 
C 


This implies that being Gromov hyperbolic is invariant under quasi- 
isometry: 


(10.2.4) Corollary (see Exercise 4). Assume 
e X, and X2 are coarsely geodesic metric spaces, and 
eX) x Xo. 

If X, 1s Gromov hyperbolic, then Xz is Gromov hyperbolic. 


(10.2.5) Corollary. The fundamental group of any compact manifold M of 
strictly negative sectional curvature is Gromov hyperbolic. 


Proof. Since M is compact, the fundamental group 7,(/M/) acts cocompactly 
on the universal cover M of M, so 71(M) ~ M (see Proposition 10.1.7). Now 
apply Theorem 10.2.2 and Corollary 10.2.4. L 


This observation allows us to determine precisely which lattices are Gro- 
mov hyperbolic: 


(10.2.6) Proposition. [ is Gromov hyperbolic if and only if rankgG = 1 
and either G/T is compact, or the unique noncompact simple factor of G is 
isogenous to SL(2, R). 


Sketch of proof. With a bit more theory than has been presented here, it is 
not difficult to show that Gromov hyperbolic groups never contain a subgroup 
isomorphic to Z x Z, so we may assume I is irreducible. 


Case 1. Assume G/T is compact. From Proposition 10.1.7, we know that [ 
is quasi-isometric to the symmetric space G/K associated to G. 
e If rankg G = 1, then G/K has negative sectional curvature, bounded 
away from 0, so it is Gromov hyperbolic. 
e If rankgG > 2, then G/K contains 2-dimensional flats, so it is not 
Gromov hyperbolic. 


Case 2. Assume G/T is not compact. (=) We may assume that G has no 
compact factors. Let U be a maximal unipotent subgroup of [. We know 
that U does not contain a subgroup isomorphic to Z x Z. Also, since G/T is 
not compact, we know U is infinite (see Remark 4.4.5). Therefore, since U is 
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nilpotent (and torsion-free), it is easy to see that U must be cyclic. It can be 
shown that this implies G is isogenous to SL(2, R). 


(<) T is virtually free, so it is Gromov hyperbolic (see Exercise 3). 


Exercises for §10.2. 


ren 


#2. 


#3. 


4A, 


Show that the coarse geodesic between two points in a Gromov hyper- 
bolic space is coarsely unique. More precisely, given C’, show there is 
some C’ > 0, such that if y and 7’ are two C-coarse geodesics with the 
same endpoints, then ¥ is contained in the C’-neighborhood of 9’. 


[Hint: If a and 6 are the two endpoints, consider the (degenerate) triangle 
abb.] 


Show that if X is a C-coarsely geodesic 6-hyperbolic space, and C’ > C, 
then there exists 6’, such that every C’-coarse geodesic from a to 0 is 
in the 6’-neighborhood of every C-coarse geodesic from a to b. (This is 
a generalization of Exercise 1.) 

[Hint: For any point c on a C’-coarse geodesic from a to b, there is a C- 
coarse triangle abc. If c is not in the d-neighborhood of ab, then there exist 


a’,b’ € ab that are distance less than 6 from points a” and b” on Gé and bc, 
respectively, such that d(a’,b’) < C+ 1. Bound d(a’,c) by noting that c is 


on the C’’-coarse geodesic a/c U cb”,] 


Show that free groups are Gromov hyperbolic. 

[Hint: The word metric corresponding to a set of free generators is 0- 
hyperbolic.| 

Prove Corollary 10.2.4. 


[Hint: Use Theorem 10.2.3 to show that coarse triangles in X2 can be ap- 
proximated by coarse triangles in X%.] 


Notes 


Almost all of the material in this chapter can be found in any treatment of 
geometric group theory, such as [5], [7], or (more elementary) [1]. A detailed 
treatment of this and much more is in [2]. 

The Lubotzky-Mozes-Raghunathan Theorem (10.1.10) is proved in [9] (or 
see [8] for an exposition of the special case where [ = SL(n, Z)). 

See [3] for an introduction to the theory of Gromov hyperbolic groups, or 
[4, 6] for much more information. The notion of d-hyperbolic group is credited 
to E. Rips, who also proved some of the basic properties (such as that they 
are finitely presented), but much of the foundational work in the subject was 
done by M. Gromov [6]. 

Proposition 10.2.6 is well known. 
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Chapter 11 


Unitary Representations 


Unitary representations are of the utmost importance in the study of Lie 
groups. For our purposes, one of the main applications is the proof of the 
Moore Ergodicity Theorem (4.10.3) in Section 11.2, but they are also the 
foundation of the definition (and study) of Kazhdan’s Property (T’) in Chap- 
ter 13. 


§11.1. Definitions 


(11.1.1) Definition. Assume H is a Hilbert space, with inner product ( | ), 
and H is a Lie group. 


1) U(H) is the group of unitary operators on H. 


2) A unitary representation of the Lie group H on the Hilbert space H 
is a homomorphism 7: H — U(H), such that the map h +> r(h)y is 
continuous, for each y € H. (If we wish to spell out that a unitary 
representation is on a particular Hilbert space H, we may refer it as 
(7,H), rather than merely 7.) 


3) The dimension of a unitary representation (7,#) is the dimension of 
the Hilbert space H. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [is a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: none. 
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4) Suppose (71, H1) and (72, H2) are unitary representations of H. 
(a) The direct sum of the representations 7 and 72 is the unitary 
representation 7, © 72 of H on H, ® Hz2 that is defined by 


(7 ® 72)(h) (G1, 92) = (mi (h)¥1, T2(h) v2), 
for h € H and y; € Hj. 
(b) 7 and 72 are isomorphic if there is a Hilbert-space isomorphism 


T: Hy = Hy that intertwines the two representations. This 
means T'(71(h)y) = 72(h) T(y), for allh € H and y € Hy. 


(11.1.2) Example. Every group H has a trivial representation, denoted 
by 1 (or 1, if it will avoid confusion). It is a unitary representation on the 
1-dimensional Hilbert space C, and is defined by 1(h)y = ¢ for all h € H and 
pec. 


Here is a more interesting example: 


(11.1.3) Example. Suppose 
e H isa Lie group, 
e H acts continuously on a locally compact, metrizable space X, and 
e is an H-invariant Radon measure on X. 


Then there is a unitary representation of H on £°(X, 11), defined by 


(m(h)y) (x) = p(h-*a) 
(cf. Exercise 4.10#11). For the action of H on itself by translations (on 
the left), the resulting representation 7eg of H on £°(H) is called the (left) 
regular representation of H. 


(11.1.4) Definition. Suppose z is a unitary representation of H on H, H' is 
a subgroup of H, and K is a closed subspace of H. 

1) K is H’-invariant if m(h’)K = K, for all h’ € H’. (If the repre- 
sentation 7 is not clear from the context, we may also say that K is 
m(H")-invariant. ) 

2) For the special case where H’ = H, an H-invariant subspace is simply 
said to be invariant, and the representation of H on any such subspace 
is called a subrepresentation of 7. More precisely, if K is H-invariant 
(and closed), then the corresponding subrepresentation is the unitary 
representation 7 of H on K, defined by mx (h)y = r(h)y, for all h € H 
and ye K. 


(11.1.5) Lemma (see Exercise 2). If (7,H) is a unitary representation of H, 
and K is a closed, H-invariant subspace of H, thena = me OTK. 


The above lemma shows that any invariant subspace leads to a decom- 
position of the representation into a direct sum of subrepresentations. This 
suggests that the fundamental building blocks are the representations that do 
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not have any (interesting) subrepresentations. Such representations are called 
“arreducible:” 


(11.1.6) Definitions. Let H be a Lie group. 


1) 


2) 


A unitary representation (7,#H) of H is irreducible if it has no non- 
trivial, proper, closed, invariant subspaces. That is, the only closed, 
H-invariant subspaces of H are {0} and H. 

The set of all irreducible representations of H (up to isomorphism) is 
called the unitary dual of H, and is denoted H. 


(11.1.7) Warnings. 


1) 


Unfortunately, it is usually not the case that every unitary represen- 
tation of H is a direct sum of irreducible representations. (This is a 
generalization of the fact that if U is a unitary operator on H, then 
H may not be a direct sum of eigenspaces of U.) However, it will be 
explained in Section 11.6 that every unitary representation is a “di- 
rect integral” of irreducible representations. In the special case where 
H =Z, this is a restatement of the Spectral Theorem for unitary oper- 
ators (cf. Proposition B7.12). 


Although the unitary dual H has a fairly natural topology, it can be 
quite bad. In particular, the topology may not be Hausdorff. Indeed, in 
some cases, the topology is so bad that there does not exist an injective, 
Borel measurable function from H to R. Fortunately, though, the worst 
problems do not arise for semisimple Lie groups: the unitary dual is 
always “tame” (measurably, at least) in this case. 


Exercises for §11.1. 


#1. 


#2. 


#3. 


4A, 


Suppose 7 is a unitary representation of H on H, and define a map 
€:HxH-H by E(h,v) = 7(h)v. Show that € is continuous. 
[Hint: Use the fact that 7(H) consists of unitary operators.] 


Prove Lemma 11.1.5. 

[Hint: If K is invariant, then K* is also invariant. Define T: K @K+ > H 
by Tig, v) = 9+] 

(Schur’s Lemma) Suppose (7,#H) is an irreducible unitary representa- 
tion of H, and T is a bounded operator on H that commutes with every 
element of 7(H). Show there exists A € C, such that Ty = Ay, for 
every yp € H. 

[Hint: Assume T is normal, by considering T + T* and T — T*, and apply 
the Spectral Theorem (B7.12).] 

Suppose (7, H) is an irreducible unitary representation of H, and ( | )’ is 
another H-invariant inner product on H that defines the same topology 


on H. as the original inner product ( | ). Show there exists c € Rt, such 
fiat <= eC | ys 
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[Hint: Each inner product provides an isomorphism of H with H*. Exercise 3 
implies they are the same, up to a scalar multiple.] 


#5. (optional) In the situation of Example 11.1.3, a weaker assumption on ju 
suffices to define a unitary representation on £°(X, 1). Namely, instead 
of assuming that py is invariant, it suffices to assume only that the class 
of ys is invariant. This means, for every measurable subset A, and all 
h © H, we have p(A) = 0 & w(hA) = 0. Then, for each h € H, the 
Radon-Nikodym Theorem (B6.13) provides a function D,: X > R*, 
such that hy = Dy. Show that the formula 

(m(h)y) = VDn(x) o(h-*a) 
defines a unitary representation of H on £°X, 1). 


§11.2. Proof of the Moore Ergodicity Theorem 
Recall the following result that was proved only in a special case: 


(11.2.1) Theorem (Moore Ergodicity Theorem (4.10.3)). Suppose 
e G is connected and simple, 
e H is a closed, noncompact subgroup of G, 
e A is a discrete subgroup of G, and 
e ¢ is an H-invariant £?-function on G/A (with 1 < p< oo). 
Then @ is constant (a.e.). 
This is an easy consequence of the following result in representation theory 
(see Exercise 1). 
(11.2.2) Theorem (Decay of matrix coefficients). If 
e G is simple, 
ez is a unitary representation of G on a Hilbert space H, such that no 
nonzero vector is fired by 7(G), and 


e {g;} is a sequence of elements of G, such that ||g;|| + 00, 
then (1(g;)¢ | 2) — 0, for every 6, € H. 


Proof. Assume, for simplicity, that 

G =SL(2,R). 
(A reader familiar with the theory of real roots and Weyl chambers should 
have little difficulty in extending this proof to the general case; cf. Exercise 6.) 
Let 

A= \* z| CG. 


Further assume, for simplicity, that 
{93} CA. 


11.2. PROOF OF THE MOORE ERGODICITY THEOREM 217 


(It is not difficult to eliminate this hypothesis; see Exercise 5.) By passing to 
a subsequence, we may assume 7(g;) converges weakly, to some operator E; 
that is, 


(7(9;)0 |b) > (Eo |p) for every $y €H 
(see Exercise 2). Let 
U={veG| 9; V9; —e} (11.2.3) 
and 
U- ={ueG| 9549; * eh}. (11.2.4) 
For u € U~ and ¢,w € H, we have 
(En(u)@ | b) = lim(n(gju)¢ | %) 
= lim(r(g;ug; *) r(g;)¢ | v) 
= lim(z(g;)¢ | v) (see Exercise 3) 
= (E¢| ¥), 
so Ex(u) = E. Therefore, letting 
HY ={¢EH| ac(uJo=¢ forallucU } 
be the space of U~-invariant vectors in H, we have 
(HY )+ CherE (11.2.5) 
(see Exercise 4). Similarly, since 
(E* | ) = (¢ | Ed) = lim(4 | 2(95)b) = lim(a (95 ")4 | ¥), 


the same argument, with E* in the place of E and ace in the place of g;, 
shows that 


(HY)+ C ker E*. 
We also have 
(x(95)¢ | t(9x)) = (r(g_95)¢ | ) ((g,‘) is unitary) 
= (r(959%')¢ | ¢) (A is abelian) 
= (n(9,')o | r(95')¢). 


Applying lim;_,.. limz-+.. to both sides yields ||E¢||? = ||E*¢||?, and this 
implies ker F = ker E*. Hence, 


ker E = ker FE + ker E* > (HY )+4 (H¥)+ 
= (HU nHY)+ = GE 2) 
Now, by passing to a subsequence of {g;}, we may assume (U,U~) = G 


(see Exercise 7). Then H'\%U ) = H© =0, so ker FE > 0+ =H. This implies 
that, for all ¢,~ © H, we have 


lim(7(95)¢ | ¥) = (Eb |) = (0| b) = 0. 
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(11.2.6) Remark. If A is a bounded operator on a Hilbert space H, and ¢, w € 
H, then the inner product (Ad | ~) is called a matriaz coefficient of A. The 
motivation for this terminology is that if A € Matnx,(R), and €1,...,€n is 
the standard basis of # = R”, then (Ae; | €;) is the (i, 7) matrix entry of A. 


The above argument yields the following more general result. 


(11.2.7) Corollary (of proof). Assume 

e G has no compact factors, 

ez is a unitary representation of G on a Hilbert space H, and 

e {gn} > c in G/N, for every proper, normal subgroup N of G. 
Then (gn |) + 0, for every ¢,v € (H%)+. 


This has the following consequence, which implies the Moore Ergodicity 
Theorem (see Exercise 9). 


(11.2.8) Corollary (Mautner phenomenon (see Exercise 8)). Assume 
e G has no compact factors, 
e 7 is a unitary representation of G on a Hilbert space H, and 
e H is a closed subgroup of G. 


Then there is a closed, normal subgroup N of G, containing a cocompact 
subgroup of H, such that every 7(H)-invariant vector in H. is t(N)-invariant. 


(11.2.9) Remark. Theorem 11.2.2 does not give any information about the 
rate at which the function (¢(g)¢ | w) tends to 0 as ||g|| > co. For some 
applications, it is helpful to know that, for many choices of the vectors @ 
and uw, the inner product decays exponentially fast: 

If G, 7, and H are as in Theorem 11.2.2, then there is a dense, linear 
subspace H., of H, such that, for all ¢,~ € Hq, there exist a,b > 1, such 
that 


b 
Ko(ge lv) <7 for all g eG. 
a 


Specifically, if A is a maximal compact subgroup of G, then we may take 
Ho ={¢€EH| the linear span of K¢ is finite-dimensional }. 
(So H is the space of “K-finite” vectors.) 


Exercises for §11.2. 


#1. Show that Theorem 11.2.1 is a corollary of Theorem 11.2.2. 
[Hint: If ¢ is an H-invariant function in £°(G/T), let ¢’ = |d|?/? € £°(G/T). 
Then (¢’, 9;¢’) = (¢’, ¢’) for every g; € H.] 

#2. Let {T;} be a sequence of unitary operators on a Hilbert space H. Show 
there is a subsequence {T;,} of {T;} and a bounded operator E on H, 


I 0o 


such that (Tj,v | w) —> (Ev | w) for all v,w € H. 


#3. 


4A, 


#5. 


#6. 
ng 


#8. 
#9. 


4.10 


#11 
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[Hint: Choose an orthonormal basis {ep}. For each p,q, the sequence 
{(Tjep | eq) is bounded, and therefore has a subsequence that converges to 
some Qp,qg- Cantor diagonalization implies that we may assume, after passing 
to a subsequence, that (Tjep | eg) 4 p,q for all p and q.] 


Suppose 
e 7 is a unitary representation of G on H, 
e {¢,;} is a sequence of unit vectors in H, and 
eu;—-einG. 
Show lim(z(u,;)@; | w) = lim(¢; | #), for all ~ EH. 
[Hint: Move z(u;) to the other side of the inner product. Then use the 
continuity of 7 and the boundedness of {¢; }.] 


Prove (11.2.5). 

[Hint: Let K be the closure of {(u)¢6—@|u€U-,¢€ (H” )*}, and note 

that K C ker E. If p € K+, then 2(u)y — w = 0 for all u € U~ (why?), so 

wEeH” | 

Eliminate the assumption that {g;} C A from the proof of Theo- 

rem 11.2.2. 

[Hint: You may assume the Cartan decomposition, which states that G = 

Kk AK, where K is compact. Hence, g; = cjajcj, with cj,c; € K and a; € A. 

Assume, by passing to a subsequence, that {c;} and {c/,} converge. Then 
lim(m(gj)¢ | b) = lim(m(aj)(m(C')@) | me) *b) = 0 


if c; + cand cj > c’.] 


Prove Theorem 11.2.2 for the special case where G = SL(n, R). 


For G, A, {g;}, U, and U~ as in the proof of Theorem 11.2.2 (with 
{g;} C A), show that if {g;} is replaced by an appropriate subsequence, 
then (U,U~) =G. 


Hint: Arrange that U is 1*) and U7 is |1 , or vice versa. 
1 * 1 


Derive Corollary 11.2.8 from Corollary 11.2.7. 


Derive Theorem 14.2.4 from Corollary 11.2.8. 
[Hint: If f is H-invariant, then (7(h)f | f) = (f | f) for all h € H.] 


Suppose 
e G is connected, with no compact factors, 
e A is a discrete subgroup of G, 
e H is asubgroup of G whose projection to every simple factor of G 
is not precompact, and 
e dis an H-invariant £? function on G/A, with 1 < p< ow. 
Show that ¢ is constant (a.e.). 
Suppose 


e H is a noncompact, closed subgroup of G, 
e I is irreducible, and 
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e ¢: G/T > R is any H-invariant, measurable function. 
Show that ¢ is constant (a.e.). 


[Hint: There is no harm in assuming that ¢ is bounded (why’), so it is in 
£°(G/T) (why?). Apply Corollary 11.2.8.] 


§11.3. Induced representations 


It is obvious that if H is a subgroup of G, then any unitary representation 
of G can be restricted to a unitary representation of H. (That is, if we 
define t|y by t|q(h) = z(h) for h € H, then a|y is a unitary representation 
of H.) What is not so obvious is that, conversely, every unitary representation 
of H can be “induced” to a unitary representation of G. We will need only 
the special case where H = T is a lattice in G (but see Exercise 1 for the 
definition in general). This construction will be a key ingredient of the proof 
in Section 13.4 that [ often has Kazhdan’s property (7). 


(11.3.1) Definition (Induced representation). Suppose 7 is a unitary repre- 
sentation of [ on H. 
1) A measurable function y: G — H is said to be (essentially) right T’- 


equivariant if, for each y € T, we have 


o(97*) = m(y) 9(g) for ae. g €G. 
2) We use LAG; H) to denote the space of right [-equivariant measurable 
functions from G to H, where two functions are identified if they agree 
almost everywhere. 


3) For y € £(G;H), we have ||y(g7)\lx = Ilv(g) lla for every 7 € T and 
a.e. g € G (see Exercise 2). Hence, ||y(g)||z, is a well-defined function 
on G/T (a.e.), so we may define the £?-norm of y by 


1/2 
olla = ( A loli ts) | 


4) We use £2(G;H) to denote the subspace of £,(G;H) consisting of the 
functions with finite £?-norm. It is a Hilbert space (see Exercise 3). 


5) Note that G acts by unitary operators on £2(G;H), via 
(g-y)(z)=9(g-'2) forgeG, pe LY{G;H), andzxeG 
(see Exercise 4). This unitary representation of G is called the repre- 
sentation induced from 7, and it is denoted Indf(r). 


Exercises for §11.3. 


##1. (optional) Suppose (7,#) is a unitary representation of a closed sub- 
group H of G. Define IndG(), without assuming that there is a G- 
invariant measure on G/H. 


11.4. REPRESENTATIONS OF COMPACT GROUPS 221 


[Hint: Since G/H is a C® manifold (see Proposition A6.2), we may use 
a nowhere-vanishing differential form to choose a measure 4 on G/H, such 
that f. is in the class of jz, for every diffeomorphism f of G/H. A unitary 
representation of G on £°(G/H, 11) can be defined by using Radon-Nikodym 
derivatives, as in Exercise 11.1445, and the same idea yields a unitary repre- 
sentation on a space of H-equivariant functions.| 


#2. Let » € LAG;H) and y ET, where z is a unitary representation of T 
on H. Show ||p(g7)||x = ||(g)|lx, for ae. g € G. 


#3. Show £2(G;H) is a Hilbert space (with the given norm, and assuming 
that two functions represent the same element of the space if and only 
if they are equal a.e.). 


#4. Show that the formula in Definition 11.3.1(5) defines a unitary repre- 
sentation of G on L2(G;H). 


#5. Show that Ind€(1) is (isomorphic to) the usual representation of G on 
L°G/T) (by left translation). 


#6. Show that if Ind@(z) is irreducible, then 7 is irreducible. 


#7. Show that the converse of Exercise 6 is false. 
[Hint: Is the representation of G on £7°(G/T) irreducible?] 


§11.4. Representations of compact groups 


(11.4.1) Example. Consider the circle R/Z. For each n € Z, define 
&: RIZ SC bye, 4) =e". 
The theory of Fourier Series tells us that {e,} is an orthonormal basis of 
£>(R/Z), which means we have the direct-sum decomposition 
£(R/Z) = PBCen. 
neZ 

Furthermore, it is easy to verify that each subspace Ce, is an invariant sub- 
space for the regular representation (see Exercise 1), and, being 1-dimensional, 
is obviously irreducible. Hence, we have a decomposition of the regular rep- 
resentation into a direct sum of irreducible representations. In addition, it is 
not difficult to see that every irreducible representation of T occurs exactly 
once in this representation. 

More generally, it is not difficult to show that every unitary representation 
of T is a direct sum of 1-dimensional representations. 


The following theorem generalizes this to any compact group. How- 
ever, for nonabelian groups, the irreducible representations cannot all be 1- 
dimensional (see Exercise 11.6#1). 


(11.4.2) Theorem (Peter-Weyl Theorem). Assume H is compact. Then: 
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1) Every unitary representation of H is (isomorphic to) a direct sum of 
irreducible representations. 


2) Every irreducible representation of H is finite-dimensional. 
3) A is countable. 


4) For the particular case of the regular representation (rea) ); we 
have 


Mt peg CS (dim H) - 7, 
(x, H)EH 


where k- a denotes the direct sum 7 @®--- Oa of k copies of a. That 
is, the “multiplicity” of each irreducible representation is equal to its 
dimension. 


Proof. In order to establish both (1) and (2) simultaneously, it suffices to 
show that if (7,1) is any unitary representation of H, then H is a direct sum 
of finite-dimensional, invariant subspaces. Zorn’s Lemma (B5.3) provides a 
subspace M of H that is maximal among those that are a direct sum of finite- 
dimensional, invariant subspaces. By passing to the orthogonal complement 
of M, we may assume that H has no nonzero, finite-dimensional, invariant 
subspaces. 
Let 


e P be the orthogonal projection onto some nonzero subspace of H that 
is finite-dimensional, 


e yt be the Haar measure on H, and 
e P= Jag t(h) P(h") du(h). 


Note that, since P commutes with 7(H) (see Exercise 3), every eigenspace 
of P is H-invariant (see Exercise 6). 

Since P is self-adjoint and each 7(h) is unitary (so 7(h~!) = r(h)*), it is 
not difficult to see that P is self-adjoint. It is also compact (see Exercise 4) 
and nonzero (see Exercise 5). Therefore, the Spectral Theorem (B7.14) im- 
plies that P has at least one eigenspace FE that is finite-dimensional. By 
contradicting the fact that H has no nonzero, finite-dimensional, invariant 
subspaces, this completes the proof of (1) and (2). 


Note that (3) is an immediate consequence of (4), since Hilbert spaces are 
assumed to be separable (see Assumption B7.7), and therefore cannot have 
uncountably many terms in a direct sum. 


We now give the main idea in the proof of (4). Given an irreducible 
representation (7,C*), we will not calculate the exact multiplicity of 7, but 
only indicate how to obtain the correct lower bound by using properties of 
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matrix coefficients. Write m(x) = | fi a(@)]. aes 

[ (Treg (h) fis) (@)] = [Fig(h'@)] = 

= m(ho") m(a) = - vat )]. 
Now, for 1 <j < k, define T;: C* > £7(H), by 
T;(a1,---,@%) = arfig + Gafog +--+ + anfe,z- 
Equating the jth columns of the two ends of (11.4.3) tells us that 
T(m(h)v) = treg(h) T(v), 

so T’ *,(C*) is an invariant subspace, and the corresponding subrepresentation 
is isomorphic to 7. Therefore, there are (at least) k different copies of 7 in Treg 


(one for each value of j). Since, by definition, k = dim 7, this establishes the 
correct lower bound for the multiplicity of 7. 


(11.4.3) 


As an illustrative, simple case of the main results in Sections 11.5 and 11.6, 
we present two different reformulations of the Peter-Weyl Theorem for the 
special case of abelian groups, after some preliminaries. 


(11.4.4) Definition. Let A be an abelian Lie group. 

1) A character of A is a continuous homomorphism X: A — T, where 
P= 7 eC] eta ih 

2) The set of all characters of A is called the Pontryagin dual of A, and 
is denoted A*. It is an abelian group under the operation of pointwise 
multiplication. (That is, the product X1Xq is defined by (X1X2)(a) = 
X1(a) Xo(a), for X1,X2 € A* and a € A.) Furthermore, if A/A° is 
finitely generated, then A* is a Lie group (with the topology of uniform 
convergence on compact sets). 


(11.4.5) Observation. Jf A is any abelian Lie group (compact or not), then 
every irreducible representation (7,H) of A is 1-dimensional (see Exercise 8). 
Therefore, the unitary dual A can be identified with the Pontryagin dual A* 
(see Exercise 9). 


Hence, for the special case where H = A is abelian, we have the following 
reformulation of the Peter-Weyl Theorem: 


(11.4.6) Corollary (see Exercise 10). Assume (1,H) is a unitary represen- 
tation of a compact, abelian Lie group A. For each X € A*, let 
eHy ={~EH| d(a)p =X(a)¢, for allac A}, and 
e Py: H- Hy be the orthogonal projection. 
Then H = Dyes Hy, 80, for alla € A, we have 
= >> X(a)P 
XeA* 


Here is another way of saying the same thing: 
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(11.4.7) Corollary (see Exercise 11). Assume (7,H) is a unitary represen- 
tation of a compact, abelian Lie group A. Then there exist 
e a Radon measure on a locally compact metric space Y , and 
e a Borel measurable function X: Y + A*: y+ Xy (where the countable 
set A* is given the discrete topology) 
such that m is isomorphic to the the unitary representation py of A on LAY) 
that is defined by 


(py(a)y)(y) = Xy(a) yly) forae A, pe LY"), andy EY. 


An analogue of this result for semisimple groups will be stated in Sec- 
tion 11.6, after we define the “direct integral” of a family of representations. 


Exercises for §11.4. 


#1. In the notation of Example 11.4.1, show tMreg(h)en = e-2nthe, . for all 
he R/Z. 


#2. Suppose (7,#) is a unitary representation of a compact group H, let 
y,w € H, and define f: H > C by f(h) = (r(h)y | wv). Show f € 
LAH). 


[Hint: It is a bounded function on a compact set.| 


#3. Suppose (7, ) is a unitary representation of a compact group H. Show 
that if T is any bounded operator on H, then 


T= f may Doth Tint) 


is an operator that commutes with every element of 7(#). 


[Hint: The invariance of Haar measure implies 7(g)Ta(g_') = T.| 


#4. Show that the operator P in the proof of Theorem 11.4.2 is compact. 


[Hint: Apply Proposition B7.11, by noting that any integral can be approx- 
imated by a finite sum, and the finite sum is an operator whose range is 
finite-dimensionall.] 


#5. Show that the operator P in the proof of Theorem 11.4.2 is nonzero. 
[Hint: Choose some nonzero y € H, such that Py = y. Then (Py | v) > 0, 
since (Pw | ~) > 0 for all ~ € H.| 

#6. Suppose (7,#) is a unitary representation of a Lie group H, T is a 
bounded operator on H, A € C, and y € H. Show that if T’ commutes 
with every element of 7(H), and T(y) = Ay, then T(x(h)y) = Ax(h)g, 
for every h € H. 


#7. Assume H is compact. Show that H is finite if and only if it has 
only finitely many different irreducible unitary representations (up to 
isomorphism). 

[Hint: You may assume Theorem 11.4.2.| 
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#8. Show that every irreducible representation of any abelian Lie group is 
1-dimensional. 


[Hint: If x(a) is not a scalar, for some a, then the Spectral Theorem (B7.12) 
yields an invariant subspace.| 


#9. Let H be a Lie group. Show there is a natural bijection between the 
set of 1-dimensional unitary representations (modulo isomorphism) and 
the set of continuous homomorphisms from H to T. 
[Hint: Any 1-dimensional unitary representation is isomorphic to a represen- 
tation on C.| 


#10. Derive Corollary 11.4.6 from Theorem 11.4.2. 


#11. Prove Corollary 11.4.7. 


[Hint: If Hy # {0}, then Hy is isomorphic to some £L(Yy, wy). Let Y = 
Uy ¥x-] 


§11.5. Unitary representations of R” 


Any character X of R” is of the form 


X(a) = e277 (a) for some (unique) y € R” 


(see Exercise 1). Therefore, the Pontryagin dual (R”)* (or, equivalently, the 
unitary dual R”) can be identified with R” (by matching X with the corre- 
sponding vector y). In particular, unlike in Theorem 11.4.2, the unitary dual 
is uncountable. 

Unfortunately, however, not every representation of R” is a direct sum 
of irreducibles. For example, the regular representation 7Tyeg of R” on LR") 
has no 1-dimensional, invariant subspaces (see Exercise 2), so it does not even 
contain a single irreducible representation and is therefore not a sum of them. 
Indeed, Fourier Analysis tells us that a function in £°(R”) is not a sum of 
exponentials, but an integral: 


roe | Aly) 2"! dy, 


where ¢ is the Fourier transform of y. Now, for each Borel subset FE of R”, 
let 


He =—{f € £°(R”) | f(y) =0 for ae. y ¢ E}. (Th5.1) 

Then it is not difficult to show that Hz is a closed, 7yeg-invariant subspace 
(see Exercise 3). 

Now, let P(E): H — Hg be the orthogonal projection. Then we can 

think of P as a projection-valued measure on R” (or on (IR”)*), and, for all 
a € R”, we have 


Treg(@) = i: sore dP(y) = / sn dE(X). 
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If we let 7 = Treg, this is a perfect analogue of the conclusion of Corol- 
lary 11.4.6, but with the sum replaced by an integral. 

A version of the Spectral Theorem tells us that this generalizes in a nat- 
ural way to all unitary representations of R”, or, in fact, of any abelian Lie 


group: 


(11.5.2) Proposition. Suppose 7 is a unitary representation of an abelian 
Lie group A on H. Then there is a (unique) projection-valued measure P 
on A*, such that 


m(a) = X(a)dP(X) for alla€ A. 
A* 


This can be reformulated as a generalization of Corollary 11.4.7: 


(11.5.3) Corollary. Let (7,H) be a unitary representation of an abelian Lie 
group A. Then there exist 


e a probability measure 4 on a locally compact metric space Y, and 
e a Borel measurable function X: Y + A*: yt? Xy, 


such that 7 is isomorphic to the unitary representation py of A on ey) 
that is defined by 


(py(a)y)(y) = Xy(a) vly) forae A, pe LY"), andy e€Y. 


Proof. Let P be the projection-valued measure given by Proposition 11.5.2. 
A closed subspace H’ of H is said to be cyclic for P if there exists wy € 
H’', such that the span of { P(E) | E Cc A*} is a dense subspace of H’. 
It is not difficult to see that H is an orthogonal direct sum of countably 
many cyclic subspaces (see Exercise 4). Therefore, we may assume H is 
cyclic (see Exercise 5) (and nonzero). So we may fix some unit vector ~ that 
generates a dense subspace of H. 
Define a probability measure ps on A* by 


u(E) = (P(E)e |p) = (P(E)p | P(E )Y), 
and let Id be the identity map on A*. 


For the characteristic function fg of each Borel subset E of A*, define 
®(fe) = P(E)y. Then (®(fz,) | ®(fm,)) = (fe, | f,), by the definition 
of 1, so ® extends to a norm-preserving linear map ©’ from £°(A*, 4) to H. 
Since w is a cyclic vector for H, we see that ®’ is surjective, so it is an 
isomorphism of Hilbert spaces. Indeed, ®’ is an isomorphism from pjg to 7 
(see Exercise 6). Oo 


Exercises for §11.5. 


#1. Show that every character X of R” is of the form X(a) = e?7(*® for 
some t € R”. 
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[Hint: Since R” is simply connected, any continuous homomorphism from R” 
to T can be lifted to a homomorphism into the universal cover, which is R. 
Apply Exercise A6#1.] 


#2. Let H be anoncompact Lie group. Show that the regular representation 
of H has no 1-dimensional, invariant subspaces. 


[Hint: If y is in a 1-dimensional, invariant subspace of £°(H), then |y] is 
constant (a.e.).| 


#3. For every measurable subset EF of R”, show that the subspace Hg de- 
fined in (11.5.1) is closed, and is invariant under 7eg(R”). 


(Hint: It is clear that { f | f € Hw} is closed. For invariance, note that the 
Fourier transform of Treg(a)f is e 77" f(y).] 


#4. Given a projection-valued measure P on a Hilbert space H, show that 
H is the orthogonal direct sum of countably many cyclic subspaces. 
[Hint: Every vector in H is contained in a cyclic subspace, the orthogonal 
complement of a P(£)-invariant subspace is P(£)-invariant, and all Hilbert 
spaces are assumed to be separable.| 

#5. In the notation of Corollary 11.5.3, suppose py, is the representation on 
L°(Y;, i) corresponding to some X;: Y; > A*. Show QP, PX, = PX: 
for some Y, ps, and X. 

[Hint: Let (Y, 4) be the disjoint union of (copies of) (Yi, 1:).] 
#6. In the notation of the proof of Corollary 11.5.3, show that ®’ is an 


isomorphism from pjq to 7. 


[Hint: Given a € A and E C A*, write E as the disjoint union of small sets 
E,,...,En (so X + X(a) is almost constant on each F;). Then 


®' (pra(a) fa) © © (D0; Xi(a) fe;) = 0, Xi(a) P(E) b & w(P(E)Y) = 2(8'(fe)), 
for any X; € E;.| 


§11.6. Direct integrals of representations 


Before we define the direct integral of a collection of unitary representations, 
we first discuss the simpler case of a direct sum of a sequence {(a, Hn) }°21 
of unitary representations. 


(11.6.1) Definition. If {H,,}°2, is a sequence of Hilbert spaces, then the 
direct sum @Q>~_, Hn consists of all sequences {pp }°2,, such that 


© Yn © Hy for each n, and 


© Yn=illenll? < o0. 


This is a Hilbert space, under the inner product 
CO 


({pn} | {wn}) = So (Pn | Dyn) 


n=1 


It contains a copy of H,, for each n, such that H» L Hn, for m # n. 
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Suppose, now, that all of the Hilbert spaces in the sequence are the same; 
say, Hn =H, for all n. Then QBr~_, Hn is equal to the set of square-integrable 
functions from Zt to H, which can be denoted £°(Z*t;H). In this notation, 
the direct sum of unitary representations is easy to describe: 


(11.6.2) Definition. If { 7,,}°@, is a sequence of unitary representations of H 
on a fixed Hilbert space H, then Q>~_, 7, is the unitary representation 7 on 
L°Z*;H) that is defined by 


(x(h)p)(n) =mn(h) y(n) forhe H, py € £{Z*;H), and ne Z?*. 


This description of the direct sum naturally generalizes to a definition of 
the direct integral of representations: 


(11.6.3) Definition. Suppose 
e H is a Hilbert space, 


e {7,}rex is a measurable family of unitary representations of H on H, 
which means: 
o X is a locally compact metric space, 
© Tz is a unitary representation of H on H, for each x € X, and 
o for each fixed y,a € H, the expression (7,(h)y | w) is a Borel 
measurable function on X x H, 
and 


e j is a Radon measure on X. 


Then [, 72 du(x) is the unitary representation 7 of H on £°(X, w;H) that is 
defined by 


(r(h)y)(z) =m2(h) y(z) forhe H, py € £1 X,u;H), and ce X. 
This is called the direct integral of the family of representations {7,}. 
The above definition is limited by requiring all of the representations to be 
on the same Hilbert space. The construction can be generalized to eliminate 
this assumption (see Remark 11.6.5), but there is often no need: 
(11.6.4) Theorem. Assume 
e 7 is a unitary representation of G, 
e G is connected, and has no compact factors, and 
e no nonzero vector is fixed by every element of 7(G). 
Then there exist H, {tz}cex, and ps, as in Definition 11.6.3, such that 
1) 9S [pre dpa) ,cand 
2) mz is irreducible for every x € X. 
(11.6.5) Remark. Up to isomorphism, there are only countably many differ- 
ent Hilbert spaces (since any two Hilbert spaces of the same dimension are 


isomorphic). It is therefore not difficult to generalize Definition 11.6.3 to deal 
with a family of representations in which the Hilbert space varies with «x. 
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Such a generalization allows every unitary representation of any Lie group to 
be written as a direct integral of representations that are irreducible. 

Here is one way. Let us say that {(72,H2z}2ex is a measurable family of 
unitary representations of H if: 


e X = U~, is the union of countably many locally compact metric 
spaces Xp, 


e for each n, there is Hilbert space H,,, such that H, = H,, for all x € X,, 
e 7, is a unitary representation of H on H,, for each x € X, 


e for each n and each y,7 € Hn, the expression (7z(h)y | w) is a Borel 
measurable function on X, x H, and 


e 4 is a Radon measure on X. 


Given such a family of representations, we define 
CO 


i Rape) = @ | Ty d(x). 

x Peace on 

With this, more general, notion of direct integral, it can be proved that every 
unitary representation of any Lie group is isomorphic to a direct integral of a 
measurable family of irreducible unitary representations. 


Exercises for §11.6. 


#1. Let H be a Lie group. Show that if the regular representation of H is 
a direct integral of 1-dimensional representations, then H is abelian. 


Notes 


There are many books on the theory of unitary representations, including 
the classics of Mackey [7, 8]. Several books, such as [6], specifically focus on 
the representations of semisimple Lie groups. 

The Moore Ergodicity Theorem (11.2.1) is due to C.C. Moore [10]. 

Corollary 11.2.7 is due to R. Howe and C.C. Moore [5, Thm. 5.1] and 
(independently) R.J. Zimmer [12, Thm. 5.2]. The elementary proof we give 
here was found by R. Ellis and M. Nerurkar [2]. Other proofs are in [9, §2.3, 
pp. 85-92] and [13, §2.4, pp. 28-31]. 

A more precise form of the quantitative estimate in Remark 11.2.9 can 
be found in [4, Cor. 7.2]. (As stated there, the result requires the matrix 
coefficient (7(g)¢ | w) to be an £? function of g, for ¢, 7 in a dense subspace 
of H, and for some p < ov, but it was proved in [1, Thm. 2.4.2] that this 
integrability hypothesis always holds.) 

Theorem 11.4.2 is proved in [11, Chap. 3] 

See [3, Chap. 2] for a nice proof of Proposition 11.5.2. (Although most 
of the proof is written for n = 1, it is mentioned on p. 31 that the argument 
works in general.) 
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See [7, Thm. 2.9, p. 108] for a proof of Remark 11.6.5’s statement that 
every unitary representation is a direct integral of irreducibles. (This is a 
generalization of Theorem 11.6.4.) 

Regarding Warning 11.1.7(2), groups for which the set of irreducible uni- 
tary representations admits an injective Borel map to [0, 1] are called “Type I” 
(and the others are “Type II”). See [7, §2.3, pp. 77-85] for some discussion of 
this. 
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Chapter 12 


Amenable Groups 


The classical Kakutani-Markov Fixed Point Theorem (12.2.3) implies that 
any abelian group of continuous linear operators has a fixed point in any 
compact, convex, invariant set. This theorem can be extended to some non- 
abelian groups; the groups that satisfy such a fixed-point property are said to 
be “amenable? and they have quite a number of interesting features. Many 
important subgroups of G are amenable, so the theory is directly relevant to 
the study of arithmetic groups, even though we will see that G and [ are usu- 
ally not amenable. In particular, the theory yields an important equivariant 
map that will be constructed in Section 12.6. 


§12.1. Definition of amenability 


(12.1.1) Assumption. Throughout this chapter, H denotes a Lie group. The 
ideas here are important even in the special case where H is discrete. 


(12.1.2) Definition. Suppose H acts continuously (by linear maps) on a lo- 
cally convex topological vector space Y. Every H-invariant, compact, convex 
subset of Y is called a compact, convex H-space. 


(12.1.3) Definition. H is amenable if and only if H has a fixed point in 
every nonempty, compact, convex H-space. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, Tis a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: none. 
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This is just one of many different equivalent definitions of amenability. 
(A few others are discussed in Section 12.3.) The equivalence of these diverse 
definitions is a testament to the fact that this notion is very fundamental. 


(12.1.4) Remarks. 


1) 


2) 


All locally convex topological vector spaces are assumed to be Haus- 
dorff. 


In most applications, the locally convex space Y is the dual of a sep- 
arable Banach space, with the weak* topology (see Definition B7.3). 
In this situation, every compact, convex subset C’' is second countable, 
and is therefore metrizable (see Remark 12.3.4). With these thoughts 
in mind, we feel free to assume metrizability when it eliminates techni- 
cal difficulties in our proofs. In fact, we could restrict to these spaces 
in our definition of amenability, because it turns out that this modified 
definition results in exactly the same class of groups (if we only consider 
groups that are second countable) (see Exercise 12.3417). 


The choice of the term “amenable” seems to have been motivated by 
two considerations: 

(a) The word “amenable” can be pronounced “a-MEAN-able? and we 
will see in Section 12.3 that a group is amenable if and only if it 
admits certain types of means. 

(b) One definition of “amenable” from the Oxford American Dictio- 
nary is “capable of being acted on a particular way’ In other 
words, in colloquial English, something is “amenable” if it is easy 
to work with. Classical analysis has averaging theorems and other 
techniques that were developed for the study of functions on the 
group R”. Many of these methods can be generalized to all 
amenable groups, so amenable groups are easy to work with. 


Exercises for §12.1. 


ei, 


5: 


#3. 


Show that every finite group is amenable. 


[Hint: For some co € C, let c = ZH heH hcp. Then c € C and ¢ is fixed 
by H.| 


Show that quotients of amenable groups are amenable. That is, if H 
is amenable, and N is any closed, normal subgroup of H, then H/N is 
amenable. 


Suppose Hj, is amenable, and there is a continuous homomorphism 
y: H, — H with dense image. Show H is amenable. 
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§12.2. Examples of amenable groups 


In this section, we will see that: 
e abelian groups are amenable (see 12.2.3), 
e compact groups are amenable (see 12.2.4), 


e solvable groups are amenable (see 12.2.7), because the class of amenable 
groups is closed under extensions (see 12.2.6), and 


e closed subgroups of amenable groups are amenable (see 12.2.8). 


On the other hand, however, it is important to realize that not all groups are 
amenable. In particular, we will see in Section 12.4 that: 


e nonabelian free groups are not amenable, and 


e SL(2,R) is not amenable. 


We begin by showing that Z is amenable: 
(12.2.1) Proposition. Cyclic groups are amenable. 


Proof. Assume H = (T) is cyclic. Given a nonempty, compact, convex 
H-space C’, choose some cg € C’. For n € N, let 


1 

— Te), 12.0.2 

on = a YTHS) (12.2.2) 
k=0 

Since C' is compact, the sequence {c,} must have an accumulation point 

c € C. It is not difficult to see that c is fixed by T (see Exercise 1). Since T’ 


generates H, this means that c is a fixed point for H. 


(12.2.3) Corollary (Kakutani-Markov Fixed Point Theorem). Every abelian 
group 1s amenable. 


Proof. Let us assume H = (g,h) is a 2-generated abelian group. (See Exer- 
cise 5 for the general case.) If C' is any nonempty, compact, convex H-space, 
then Proposition 12.2.1 implies that the set CY of fixed points of g is nonempty. 
It is easy to see that C9 is compact and convex (see Exercise 2), and, because 
FH is abelian, that C9 is invariant under h (see Exercise 3). Hence, C9 is a 
nonempty, compact, convex (h)-space. Therefore, Proposition 12.2.1 implies 
that h has a fixed point c in C9. Now c is fixed by g (because it belongs 
to C9), and c is fixed by h (by definition), so c is fixed by (g,h) = H. 


Compact groups are also easy to work with: 
(12.2.4) Proposition. Compact groups are amenable. 


Proof. Assume H is compact, and let 44 be a Haar measure on H. Given 
a nonempty, compact, convex H-space C’, choose some co € C’. Since ps is a 
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probability measure, we may let 


C= ‘| h(co) du(h) € C. (12.2.5) 

H 
(In other words, c is the center of mass of the H-orbit of co.) The H-invariance 
of yw implies that c is a fixed point for H (see Exercise 6). O 


It is easy to show that amenable extensions of amenable groups are 
amenable (see Exercise 7): 


(12.2.6) Proposition. If H has a closed, normal subgroup N, such that N 
and H/N are amenable, then H is amenable. 


Combining the above results has the following consequences: 


(12.2.7) Corollary. 
1) Every solvable group is amenable. 


2) If H has a solvable, normal subgroup N, such that H/N is compact, 
then H is amenable. 


Proof. Exercises 9 and 10. O 


The converse of Corollary 12.2.7(2) is true for connected groups (see Propo- 
sition 12.4.7). 


(12.2.8) Proposition. Every closed subgroup of any amenable group is amenable. 


Proof. This proof employs a bit of machinery, so we postpone it to Sec- 
tion 12.5. (For discrete groups, the result follows easily from some other 
characterizations of amenability; see Remarks 12.3.13 and 12.3.24 below.) O 


Exercises for §12.2. 


#1. Suppose T is a continuous linear map on a locally convex space Y. 
Show that if c is any accumulation point of the sequence {c,,} defined 
by (12.2.2), then c is T-invariant. 

[Hint: If ||en—cl| is small, then ||T’'(cn)—T(c)|| is small. Show that ||T'\(en)—cn|| 
is small whenever n is large. Conclude that ||T'\(c) —c|| is smaller than every €.] 

#2. Suppose C is a compact, convex H-space. Show that the set C” of 
fixed points of H is compact and convex. 

[Hint: Closed subsets of C' are compact.| 


#3. Suppose H acts on a space C, A is a subgroup of H, and h is an element 
of the centralizer of A. Show that the set C4 of fixed points of A is 
invariant under h. 


#4. Establish Exercise 3 under the weaker assumption that h is an element 
of the normalizer of A, not the centralizer. 


#65. 


#6. 


#7. 


#8. 


#9. 


4£10. 


#11. 


#12. 


4£13. 


4614, 


415. 


4£16. 
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Prove Corollary 12.2.3. 


[Hint: For each h € H, let C” be the set of fixed points of h. The given 
argument implies (by induction) that { C” | h € H } has the finite intersection 
property, so the intersection of these fixed-point sets is nonempty.| 


Show that if ys is the Haar measure on H, and H is compact, then the 
point c defined in (12.2.5) is fixed by H. 


Prove Proposition 12.2.6. 
[Hint: Exercises 2 and 4.] 


Show that H, x H2 is amenable if and only if H; and Hz are both 
amenable. 


Prove Corollary 12.2.7(1). 
[Hint: Proposition 12.2.6.] 


Prove Corollary 12.2.7(2). 
[Hint: Proposition 12.2.6.] 


Suppose H is discrete, and Hj, is a finite-index subgroup. Show H is 
amenable if and only if H, is amenable. 


Show that if A is a lattice in H, and A is amenable, then H is amenable. 
[Hint: Let uw = Siepn hvu dh, where v is a fixed point for A.] 


Assume H is discrete. Show that if every finitely generated subgroup 
of H is amenable, then H is amenable. 


[Hint: For each h € H, let C” be the set of fixed points of h. Then {C” | 
h € H} has the finite intersection property, so (),, C” 4 0.] 


* 
Let P= * * | Cc SL(8,R). Show that P is amenable. 


a Be 
[Hint: P is solvable.| 


Assume there exists a discrete group that is not amenable. Show the 
free group Fy on 2 generators is not amenable. 


[Hint: F, is a subgroup of F2.] 


Assume there exists a Lie group that is not amenable. 
a) Show the free group F2 on 2 generators is not amenable. 
b) Show SL(2,R) is not amenable. 
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§12.3. Other characterizations of amenability 


Here are a few of the many conditions that are equivalent to amenability. The 
necessary definitions are provided in the discussions that follow. 


(12.3.1) Theorem. The following are equivalent: 
1) A is amenable. 
2) H has a fixed point in every nonempty, compact, convex H-space. 


3) If H acts continuously on a compact, metrizable topological space X, 
then there is an H-invariant probability measure on X. 


4) There is a left-invariant mean on the space Cyaa(H) of all real-valued, 
continuous, bounded functions on H. 


5) There is a left-invariant finitely additive probability measure p defined 
on the collection of all Lebesgue measurable subsets of H, such that 
p(E) = 0 for every set E of Haar measure 0. 

6) The left regular representation of H on £°%H) has almost-invariant 
vectors. 


7) There exists a Folner sequence in H. 


The equivalence (1 = 2) is the definition of amenability (12.1.3). Equiv- 
alence of the other characterizations will be proved in the remainder of this 
section. 


§12.3(i). Invariant probability measures. 


(12.3.2) Definitions. Let X be a complete metric space. 
1) A measure yw on X is a probability measure if u(X) = 1. 
2) Prob(X) denotes the space of all probability measures on X. 


Any measure on X is also a measure on the one-point compactification XT 
of X, so, if X is locally compact, then the Riesz Representation Theorem 
(B6.10) tells us that every finite measure on X can be thought of as a linear 
functional on the Banach space C(X7*) of continuous functions on X*. This 
implies that Prob(X) is a subset of the closed unit ball in the dual space 
C(X7)*, and therefore has a weak* topology. If X is compact (so there is no 
need to pass to XT), then the Banach-Alaoglu Theorem (B7.4) tells us that 
Prob(X) is compact (see Exercise 1). 


(12.3.3) Example. If a group H acts continuously on a compact, metrizable 
space X, then Prob(X) is a compact, convex H-space (see Exercise 2). 


(12.3.4) Remark (Urysohn’s Metrization Theorem). Recall that a compact, 
Hausdorff space is metrizable if and only if it is second countable, so requir- 
ing a compact, separable, Hausdorff space to be metrizable is not a strong 
restriction. 
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(12.3.5) Proposition (1 <= 3). H is amenable if and only if for every contin- 
uous action of H on a compact, metrizable space X, there is an H-invariant 
probability measure ps on X. 


Proof. (=) If H acts on X, and X is compact, then Prob(X) is a nonempty, 
compact, convex H-space (see Example 12.3.3). So H has a fixed point in 
Prob(X); this fixed point is the desired H-invariant measure. 


(<) Suppose C is a nonempty, compact, convex H-space. By replacing 
C' with the closure of the convex hull of a single H-orbit, we may assume C’ 
is separable; then C' is metrizable (see Exercise 3). Since H is amenable, this 
implies there is an H-invariant probability measure jz on C’. Since C' is convex 
and compact, the center of mass 


p= f eau(o 


belongs to C' (see Exercise 4). Since y is H-invariant (and the H-action is by 
linear maps), a simple calculation shows that p is H-invariant (see Exercise 6). 


§12.3(ii). Invariant means. 


(12.3.6) Definition. Suppose VY is some linear subspace of £°°(H), and as- 
sume Y contains the constant function 1y that takes the value 1 at every 
point of H. A mean on yY is a linear functional \ on Y, such that 


e \(lz) =1, and 
e is positive, i.e., A(f) > 0 whenever f > 0. 


(12.3.7) Remark. Any mean is a continuous linear functional; indeed, ||A|| = 1 
(see Exercise 8). 


It is easy to construct means: 


(12.3.8) Example. If ¢ is any unit vector in £\(H), and p is the left Haar 
measure on H, then defining 


x)= f fel 


produces a mean (on any subspace of £°°(H) that contains 1;,). Means con- 
structed in this way are (weakly) dense in the set of all means (see Exercise 12). 


Compact groups are the only ones with invariant probability measures, 
but invariant means exist more generally: 


(12.3.9) Proposition (1 = 4). If H is amenable, then there exists a left- 
invariant mean on the space Chaa(H) of bounded, continuous functions on H. 


Proof. The set of means on Chaa(H) is obviously nonempty, convex and 
invariant under left translation (see Exercise 13). Furthermore, it is a weak* 
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closed subset of the unit ball in Cyaq(H)* (see Exercise 14), so it is compact by 
the Banach-Alaoglu Theorem (Proposition B7.4). Therefore, the amenability 
of H implies that some mean is left-invariant. (Actually, there is a slight 
technical problem here if H is not discrete: the action of H on Cyaa(H) may 
not be continuous in the sup-norm topology, because continuous functions do 
not need to be uniformly continuous. ) L] 


(12.3.10) Remark. With a bit more work, it can be shown that if H is 
amenable, then there is a left-invariant mean on £°(H), not just on Chaa(H) 
(see Exercise 14). Therefore, Cyaa(H) can be replaced with £°°(H) in The- 
orem 12.3.1(4). Furthermore, there exists a mean on £°(H) that is bi- 
invariant (both left-invariant and right-invariant) (cf. Exercise 16). 


(12.3.11) Proposition (4 => 3). Suppose H acts continuously on a compact, 
metrizable space X. If there is a left-invariant mean on Cyaa(H), then there 
is an H-invariant probability measure on X. 


Proof. Fix some x € X. Then we have a continuous, H-equivariant linear 
map from C(X) to Chaa(H), defined by 


J (h) = f(ha). 


Therefore, any left-invariant mean on Chaa(H) induces an H-invariant mean 
on C(X) (see Exercise 15). Since X is compact, the Riesz Representation 
Theorem (B6.10) tells us that any continuous, positive linear functional on 
C(X) is a measure; thus, this H-invariant mean \ can be represented by 
an H-invariant measure 4 on X. Since \ is a mean, we have (1) = 1, so 
u(X) = 1, which means that py is a probability measure. O 


§12.3(iii). Invariant finitely additive probability measures. The 
following proposition is based on the observation that, just as probability 
measures on X correspond to elements of the dual of C(X), finitely additive 
probability measures correspond to elements of the dual of £°(X). 


(12.3.12) Proposition (4 = 5). There is a left-invariant mean on £L°°(X) 
if and only if there is a left-invariant finitely additive probability measure p 
defined on the collection of all Lebesgue measurable subsets of H, such that 
p(E) = 0 for every set E of Haar measure 0. 


Proof. (=) Because H is amenable, there exists a left-invariant mean \ on 
£L°(H) (see Remark 12.3.10). For a measurable subset F of H, let p(E) = 
A(X), where X zg is the characteristic function of E. It is easy to verify that 
p has the desired properties (see Exercise 18). 
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(<) We define a mean \ via an approximation by step functions: for 


f € L(A), let 
A(f) = inf S > aip(Ei) is > aXe, ae. 
i=1 i= 1 


Since p is finitely additive, it is straightforward to verify that is a mean 
on £°(H) (see Exercise 19). Since p is bi-invariant, we know that . is also 
bi-invariant. 


(12.3.13) Remark. 


1) Proposition 12.3.12 easily implies that every subgroup of a discrete 
amenable group is amenable (see Exercise 20), establishing Proposi- 
tion 12.2.8 for the case of discrete groups. In fact, it is not very diffi- 
cult to prove the general case of Proposition 12.2.8 similarly (see Exer- 
cise 21). 

2) Because any amenable group H has a bi-invariant mean on £°(H) 
(see Remark 12.3.10), the proof of Proposition 12.3.12(=) shows that 
the finitely additive probability measure p can be taken to be bi- 
invariant. 


§12.3(iv). Almost-invariant vectors. 


(12.3.14) Definition. An action of H on a normed vector space B has almost- 
invariant vectors if, for every compact subset C' of H and every « > 0, there 
is a unit vector v € B, such that 

lcv—v||<e forallceC. (12.3.15) 


(A unit vector satisfying (12.3.15) is said to be (€, C)-invariant. ) 


(12.3.16) Example. Consider the regular representation of H on £°(H). 


1) If H is a compact Lie group, then the constant function lq belongs to 
L°H), so £°(H) has an H-invariant unit vector. 

2) If H=R, then £°H) does not have any (nonzero) H-invariant vectors 
(see Exercise 22), but it does have almost-invariant vectors: Given C’ 
and «, choose n € N so large that C C [—n,n] and 2/,/n < e. Let 
o= 1X n2, where X,,2 is the characteristic function of [0,n?]. Then ¢ 
is a unit vector and, for c € C, we have 


n 1 n2+n 4 
leo sf Sart [Saas <e, 
ee) eae f) n 
(12.3.17) Remark. £7(H) has almost-invariant vectors if and only if £(H) has 
almost-invariant vectors (see Exercise 23). Therefore, £°(H) may be replaced 


with ¢'(H) in Theorem 12.3.1(6). (In fact, £?(H) may be replaced with 
LH), for any p € [1, 00) (see Exercise 24).) 
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(12.3.18) Proposition (4 <6). There is a left-invariant mean on L°(H) if 
and only if £°(H) has almost-invariant vectors. 


Proof. Because of Remark 12.3.17, we may replace £°(H) with £\(H). 

(<) By applying the construction of means in Example 12.3.8 to almost- 
invariant vectors in £'(H), we obtain almost-invariant means on £°(H). A 
limit of almost-invariant means is invariant (see Exercise 25). 

(=) Because the means constructed in Example 12.3.8 are dense in the 
space of all means, we can approximate a left-invariant mean by an £! func- 
tion. Vectors close to an invariant vector are almost-invariant, so £'(H) has 
almost-invariant vectors. However, there are technical issues here; one prob- 
lem is that the approximation is in the weak* topology, but we are looking for 
vectors that are almost-invariant in the norm topology. See Exercise 26 for a 
correct proof in the case of discrete groups (using the fact that a convex set 
has the same closure in both the norm topology and the weak* topology). OU 


§12.3(v). Folner sequences. 


(12.3.19) Definition. Let {F;,} be a sequence of measurable sets in H, such 
that 0 < u(Fi,) < co for every n. We say {F;,} is a Folner sequence if, for 
every compact subset C’ of H, we have 
BIN CF, 
ie dees ne Cl 2) (12.3.20) 
n—-+oco cEC uF, ) 


where js is the Haar measure on H. 
(12.3.21) Example. 


1) If F, = B,(0) is the ball of radius n in R*, then {F,,} is a Folner 
sequence in R* (see Exercise 29). 


2) The free group Fz on 2 generators does not have Fglner sequences 
(see Exercise 12.442). 


The reason that R‘ has a Fglner sequence, but the free group F2 does 
not, is that R° is amenable, but Fe is not: 


(12.3.22) Proposition (6 = 7). There is an invariant mean on LH) if and 
only if H has a Folner sequence. 


Proof. (<) Normalized characteristic functions of Folner sets are almost 
invariant vectors in £'(H) (see Exercise 30). 

(=) Let us assume H is discrete. Given € > 0, and a finite subset C of H, 
we wish to find a finite subset F’ of H, such that 


#(F Ac(F)) 
#(Fn) 


Since H is amenable, we know £'(H) has almost-invariant vectors (see Re- 
mark 12.3.17); hence, there exists f € £(H), such that 


Lyf 20: 


<e forallceCd. 


12.3. OTHER CHARACTERIZATIONS OF AMENABILITY 243 


2) ||fll =1, and 
3) |lef — fll1 < €/#C, for every cE C. 
Note that if f were the normalized characteristic function of a set F’, then 
this set F’ would be what we want; for the general case, we will approximate 
f by a sum of such characteristic functions. 
Approximating f by a step function, we may assume f takes only finitely 
many values. Hence, there exist: 


e finite subsets Ay C Ao C--- C A, of A, and 
e real numbers aj,...,Q@y > 0, 
such that 
1) ay +aq+-+-+Q, = 1 and 
2) f=aifi +oafet---Onfn, 


where f; is the normalized characteristic function of A; (see Exercise 33). For 
all 7 and j, and any c € H, we have 


A; \ cA; is disjoint from cA; \ A; (123.23) 
(see Exercise 34), so, for any « € H, we have 
file) > (cfi)(@) => fiw) 2 (efi) (@) 
and 
fila) < (cfi)(®) => f;() S (cf) (2). 


Therefore 


(ef — f)(@)| = Dol(efi — fi)(x)| for allz € H. 
Summing over H yields 


dcilleh — fill, = |lef — fll < re 


Summing over C’, we conclude that 


So ai > llefi — Filla <e 


a cE 
Since }*, a; = 1 (and all terms are positive), this implies there is some 7, such 
that 
So llefi - fill <e. 
cEC 
Hence, ||cf; — fill, < €, for every c € C, so we may let F = A;. 


(12.3.24) Remark. Folner sets provide an easy proof that subgroups of discrete 
amenable groups are amenable. 
Proof. Let 

e A be aclosed subgroup of a discrete, amenable group H, 

e C be a finite subset of A, and 
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ec>0. 


Since H is amenable, there is a corresponding F¢lner set F' in H. 
It suffices to show there is some h € H, such that FhMA is a Folner set 
in A. We have 


#F= SY) #(FN Ah) 
AheA\H 
and, letting ¢’ = e#C, we have 
(l+e)#F>#(CF)= SY) #(C(FNAN)), 
Ahe A\H 
so there must be some Ah € A\H, such that 
#:(C(F MN Ah)) < (14+ €)#(F'N Ah) 
(and FN Ah # 0). Then, letting F’ = Fh-!/N A, we have 
#(CF’) = #(C(FN Ah) < (1+) (FN Ah) = (1+) EP 
so F’ is a Folner set in A. O 


Exercises for §12.3. 


Invariant probability measures 


#1. In the setting of Example 12.3.3, show that Prob(X) is a compact, 
convex subset of C'(.X)*. 


[Hint: You may assume the Banach-Alaoglu Theorem (Proposition B7.4).] 


#2. Suppose H acts continuously on a compact, metrizable space X. There 
is an induced action of H on Prob(X) defined by 
(hapt)(A) = p(h7'A) for h € H, uw € Prob(X), and AC X. 


Show that this induced action of H on Prob(X) is continuous (with 
respect to the weak* topology on Prob(X)). 


#3. Let A be a separable subset of a Frchet space Y. Show 
a) A is second countable. 
b) If A is compact, then A is metrizable. 


[Hint: (b) Remark 12.3.4.] 


#4. Let p be a probability measure on a compact, convex subset C' of a 
Frchet space Y. The center of mass of C’ is a point c € Y, such that, 
for every continuous linear functional 4 on Y, we have 


xo) = fa) dus(a). 


Show the center of mass of js exists and is unique, and is an element 
of C. 


#5. 


#6. 


HT. 
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Give an example of a probability measure p on a Frchet space, such 
that the center of mass of js does not exist. 

[Hint: There are probability measures on R*, such that the center of mass 
is infinite.] 

Show that if p is the center of mass of a probability measure pz on 
a Frchet space Y, then p is invariant under every continuous, linear 
transformation of Y that preserves ju. 


Suppose H acts continuously on a compact, metrizable space X. Show 
that the map 
H x Prob(X): (Ah, 1) > haw 


defines a continuous action of H on Prob(X). 


Left-invariant means 


#8. 


#9. 


4£10. 


#11. 


412. 


4613. 


4614, 


Verify Remark 12.3.7. 

[Hint: (1H) = 1 implies ||A|| > 1. For the other direction, note that if 
| flloo <1, then ly — f > 0 a.e., so A(Ly — f) > 0; similarly, A(f + 1) > 0.] 
Show that the restriction of a mean is a mean. More precisely, let ); 
and \ be linear subspaces of £°(H), with ly € VY, C Vo. Show that 
if A is a mean on Ys, then the restriction of \ to Y, is a mean on Yj. 


Suppose A is a mean on Chaa(H), the space of bounded, continuous 
functions on H. For f € Cpaa(H), show 


min f < A(f) < max f. 


For h € H, define 06;,: Cpaa( Hf) +R by on(f) = f Ghai): Show dp, is a 
mean on Chga(). 


Let B be any linear subspace of £°(H), such that B contains lq and 
is closed in the £°-norm. Show that the means constructed in Exam- 
ple 12.3.8 are weak* dense in the set of all means on B. 


[Hint: If not, then the Hahn-Banach Theorem implies there exist « > 0, a 
mean A, and some f € (6*)* = B, such that 


A> et f Fleldy 


for every unit vector ¢ in £'(H). This contradicts the fact that A(f) < 
ess. sup f.| 
Let M be the set of means on Chaa(H). Show: 
a) M90. 
b) M is convex. 
c) M is H-invariant. 
[Hint: (a) Evaluation at any point is a mean.| 
Let M be the set of means on Chaa(H). Show: 


a) M is contained in the closed unit ball of Cyaa(H)*. (That is, we 
have |A(f) < ||flloo for every f € Craa(H).) 
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b) (MM) is weak* closed. 
c) M is compact in the weak* topology. 
[Hint: You may assume the Banach-Alaoglu Theorem (Proposition B7.4).] 
Show that if H is amenable, then there is a left-invariant mean on 
LEE: 
(Hint: Define \(f) = wo(f *7), where Ao is a left-invariant mean on Cpaa(H), 
and 7 is a nonnegative function of integral 1.] 


Suppose w: Y + X is continuous, and A is a mean on Chga(Y). Show 
that w,. (defined by (w,A)(f) = A(f ow)) is a mean on Cyaga(X). 


Assume H is amenable and discrete. Show there is a bi-invariant mean 
on L°( A). 

[Hint: Since £°(H) = Cyaa(H), amenability implies there is a left-invariant 
mean on £°(H) (see Theorem 12.3.1(4)). Now H acts by right translations 
on the set of all such means, so amenability implies that some left-invariant 
mean is right-invariant.| 


(harder) Assume H has a fixed point in every metrizable, nonempty, 
compact, convex H-space (and H is second countable). Show H is 
amenable. 

[Hint: To find a fixed point in C, choose some co € C. For each mean A on 
Cpaa(H) and each p € Y*, define ¢,(p) = A(h + p(heo)), so dy € (V*)’, the 
algebraic dual of y*. If \ is a convex combination of evaluations at points 
of H, it is obvious there exists c, € C, such that ¢,(p) = p(c,). Since the 
map A +> ¢) is continuous (with respect to appropriate weak topologies), 
this implies c, exists for every A. The proof of Proposition 12.3.9 shows that 
A may be chosen to be left-invariant, and then c) is H-invariant.] 


Invariant finitely additive probability measures 


4£18. 


419. 


Verify that p, as defined in the proof of Proposition 12.3.12(=-), has 
the properties specified in the statement of the proposition. 


Let p and X be as in the proof of Proposition 12.3.12(<). 


) If So aiXn, = > —d; Xx, ae., show 5 a; p(Ei) = S~ bjp(F)). 
i=1 j=l i=1 
b) If So aiXz, < 45d; Xx, ae., show S~ aj; p(Ei) < S> bj p(F)). 


= j=l i=1 j=l 
c) Show that \(1q) =1. 

d) Show that if f > 0, then A(f) > 0. 

e) Show that 


AU) =n | real )] 12 ake wef. 


f) Show that » is a mean on £°(H). 


[Hint: (a,b) By passing to a refinement, arrange that {£;} are pairwise dis- 
joint, {Fj} are pairwise disjoint, and each F; is contained in some F7. | 


420. 


#21. 
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Use Proposition 12.3.12 to prove that every subgroup A of a discrete 
amenable group H is amenable. 

[Hint: Let X be a set of representatives of the right cosets of A in H, and let 
A be a left-invariant finitely additive probability measure on H. For EF C A, 
define A(EZ) = X(EX).| 


Use Proposition 12.3.12 to prove that every closed subgroup A of an 
amenable group H is amenable. 


[Hint: Let X be a Borel set of representatives of the right cosets of A in H, 
and define X as in the solution of Exercise 20. Fubini’s Theorem implies that 
if & has measure 0 in A, then XA has measure 0 in H. You may assume 
(without proof) the fact that if f: M — N is a continuous function between 
manifolds M and N, and FE is a Borel subset of M, such that the restriction 
of f to E is one-to-one, then f(£) is a Borel set in N. | 


Almost-invariant vectors 


#22. 


423. 


2A. 


425. 


426. 


a) For v € £°(A), show that v is invariant under translations if and 
only if v is constant (a.e.). 

b) Show that H is compact if and only if 27H) has a nonzero vector 
that is invariant under translation. 


Show that £?(H) has almost-invariant vectors if and only if £(H) has 
almost-invariant vectors. 


[Hint: Note that f? — 9° = (f —g9)(f +4), so ||f? —97|l1 < lf —gll2 lf +9ll2- 
Conversely, for f, g > 0, we have (f—g)” < |f?—g?|, so ||f—gll3 < ||f?-97 |l1 J 


For p € [1,00), show that £1(H) has almost-invariant vectors if and 
only if £1) has almost-invariant vectors. 

[Hint: If p < q, then almost-invariant vectors in £”(H) yield almost-invariant 
vectors in £4(H), because |(f — g)|*/? < |f2/? — g?/”|. And almost-invariant 


vectors in £?(#) yield almost-invariant vectors in £? / *(H), by the argument 
of the first hint in Exercise 23.] 


Let 
e {C,,} be an increasing sequence of compact subsets of H, such that 
OF Cn = A, 
e €, =1/n, 
e ¢, be an (€,,C,)-invariant unit vector in £1(H), 
e \,, be the mean on £%° obtained from ¢, by the construction in 
Example 12.3.8, and 
e \ be an accumulation point of {A,}. 
Show that A is invariant. 


Assume #H is discrete. Let 
P={¢€ L1H) |¢>0, [ld], = 1}. 


Suppose {¢;} is a net in P, such that the corresponding means \,; 
converge weak* to an invariant mean on L°(H). 
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a) For each h € H, show that the net {h*¢; — ¢;} converges weakly 
to 0. 
b) Take a copy £' HA), of L(A) for each h € H, and let 


VS BA. 
heH 


with the product of the norm topologies. Show that Y is a Frchet 
space. 
c) Show that the weak topology on Y is the product of the weak 
topologies on the factors. 
) Define a linear map T: £'(H) > Vv by T(f)n = h* f — f. 
) Show that the net {7'(¢;}) converges to 0 weakly. 
) Show that 0 is in the strong closure of T(P). 
g) Show that £'(H) has almost-invariant vectors. 


#27. Show that if H is amenable, then H has the Haagerup property. By 


definition, this means there is a unitary representation of H on a Hilbert 
space H, such that there are almost-invariant vectors, and all matrix 
coefficients decay to 0 at oo as in the conclusion of Theorem 11.2.2. (A 
group with the Haagerup property is also said to be a- T-menable.) 


Folner sequences 


4£28. 


4£29. 


430. 


#31. 


#32. 


#33. 


Show that {F;,} is a Folner sequence if and only if, for every compact 
subset C' of H, we have 


Justify Example 12.3.21(1). 
[Hint: C C B,(0), for some r. We have ju(B,+0(0)) /u(Be(0)) > 1] 


Prove Proposition 12.3.22(<). 


[Hint: Normalizing the characteristic function of F;, yields an almost-invariant 
unit vector. ] 


Show (12.3.20) is equivalent to 


pil, UcF,,) 
lim max —————— = 1. 

n—-oco cEC b( Fy, ) 
Assume #1 is discrete. Show that a sequence {F;,} of finite subsets of H 
is a Folner sequence if and only if, for every finite subset C' of H, we 


have 


Given a step function f, as in the proof of Proposition 12.3.22(=), let 
ea, > a2 >---:> Gy be the finitely many positive values taken by f, 
e A;={heH | f(h) >a; }, and 
e f; be the normalized characteristic function of A;. 

Show 
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a) Ai © Ap C--- C Ap, 
b) there exist real numbers a1,...,@, > 0, such that 
f=afit---+oanfn, 


and 
Cc) ay +--+ +a, = 1. 
#34. Prove (34). 
[Hint: Note that either A; C A; or A; C Aj.] 
#35. (harder) Use Folner sets to prove Remark 12.3.24 (without assuming H 
is discrete). 
[Hint: Adapt the proof of the discrete case. There are technical difficulties, 
but begin by replacing the sum over A\H with an integral over A\H.| 


#36. A finitely generated (discrete) group A is said to have subexponential 
growth if there exists a generating set S for A, such that, for every 
e>0, 


#(S US~1)" < e® for all large n. 


Show that every group of subexponential growth is amenable. 


#37. Give an example of an finitely generated, amenable group that does not 
have subexponential growth. 


§12.4. Some nonamenable groups 
Other proofs of the following proposition appear in Exercises 1 and 2. 
(12.4.1) Proposition. Nonabelian free groups are not amenable. 


Proof. For convenience, we consider only the free group F2 on two gener- 
ators a and b. Suppose Fz has a left-invariant finitely additive probability 
measure p. (This will lead to a contradiction.) 

We may write Fy = AUA”- UBUB™ U {fe}, where A, A~, B, and B™ 
consist of the reduced words whose first letter is a, a~', b, or b~1, respectively. 
Assume, without loss of generality, that p(AUA7~) < p(BUB7) and p(A) < 
p(A~). Then 


p(BUB™ U fe}) > ; and p(A)< 
Then, by left-invariance, we have 

p(a(B UB U {e})) =o BUBB Ute) ; > p(A). 
This contradicts the fact that a(BUB™ U {e}) C A. 
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Combining this with the fact that subgroups of discrete amenable groups 
are amenable (see Proposition 12.2.8), we have the following consequence: 


(12.4.2) Corollary. Suppose H is a discrete group. If H contains a non- 
abelian, free subgroup, then H is not amenable. 


(12.4.3) Remarks. 


1) The converse of Corollary 12.4.2 is known as “von Neumann’s Conjec- 
ture? but it is false: a nonamenable group with no nonabelian free 
subgroups was found by Ol’shanskii in 1980. (The name is misleading: 
apparently, the conjecture is due to M. Day, and was never stated by 
Von Neumann.) 


2) The assumption that H is discrete cannot be deleted from the state- 
ment of Corollary 12.4.2. For example, the orthogonal group SO(3) 
is amenable (because it is compact), but the Tits Alternative (4.9.1) 
implies that it contains nonabelian free subgroups. 


3) The nonamenability of nonabelian free subgroups of SO(3) is the basis 
of the famous Banach-Tarski Paradox: A 3-dimensional ball B can 
be decomposed into finitely many subsets Xj,..., Xn, such that these 
subsets can be reassembled to form the union of two disjoint balls of the 
same radius as B. (More precisely, the union B, U Bz of two disjoint 
balls of the same radius as B can be decomposed into subsets Y1,..., Yn, 
such that Y; is congruent to X;, for each 7.) 


4) If H contains a closed, nonabelian, free subgroup, then H is not 
amenable. 


Here is an example of a nonamenable connected group: 
(12.4.4) Proposition. SL(2,R) is not amenable. 


Proof. Let G = SL(2,R). The action of G on RU {oo} & S!? by linear- 
fractional transformations is transitive, and the stabilizer of the point 0 is the 
subgroup P = [* 3], so G/P is compact. However, the Borel Density Theo- 
rem implies there is no G-invariant probability measure on G/P (see Exer- 
cise 4.6#2). (See Exercise 4 for a direct proof that there is no G-invariant 


probability measure.) So G is not amenable. O 


More generally: 


(12.4.5) Proposition. If a connected, semisimple Lie group G is not compact, 
then G is not amenable. 


Proof. The Jacobson-Morosov Lemma (A5.8) tells us that G contains a closed 
subgroup isogenous to SL(2,R). Alternatively, recall that any lattice [ in G 
must contain a nonabelian free subgroup (see Corollary 4.9.2), and, being 
discrete, this is a closed subgroup of G. O 
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(12.4.6) Remark. Readers familiar with the structure of semisimple Lie groups 
will see that the proof of Proposition 12.4.4 generalizes to the situation of 
Proposition 12.4.5: Since G is not compact, it has a proper parabolic sub- 
group P. Then G/P is compact, but the Borel Density Theorem implies that 
G/P has no G-invariant probability measure. 


Combining this result with the structure theory of connected Lie groups 
yields the following classification of connected, amenable Lie groups: 


(12.4.7) Proposition. A connected Lie group H is amenable if and only if 
H contains a connected, closed, solvable normal subgroup N, such that H/N 
is compact. 


Proof. (<=) Corollary 12.2.7(2). 

(=) The structure theory of Lie groups tells us that there is a connected, 
closed, solvable, normal subgroup R of H, such that H/R is semisimple. (The 
subgroup R is called the radical of H.) Since quotients of amenable groups 
are amenable (see Exercise 12.1442), we know that H/R is amenable. So H/R 
is compact (see Proposition 12.4.5). 


Exercises for §12.4. 


#1. a) Find a homeomorphism ¢ of the circle S', such that the only ¢- 
invariant probability measure is the delta mass at a single point p. 
b) Find two homeomorphisms ¢, and ¢2 of S, such that the subgroup 
(¢1, 62) they generate has no invariant probability measure. 
c) Deduce that the free group F2 on 2 generators is not amenable. 
[Hint: (a) Identifying S* with [0, 1], let ¢(x) = «?. For any x € (0,1), we have 
6((0,x)) = (0,27), so ((2?,x)) = 0. Since (0,1) is the union of countably 
many such intervals, this implies that 1((0, 1) = 0. 


#2. Show explicitly that free groups do not have F¢lner sequences. More 
precisely, let Fz be the free group on two generators a and b, and show 
that if F' is any nonempty, finite subset of F2, then there exists c € 
{a,b,a~1,b~1}, such that #(F \ cF) > (1/4)#F. This shows that F2 
free groups is not amenable. 

[Hint: Suppose F = AU BUA UB’, where words in A, B,A~,B™ start 
with a,b,a~',b~", respectively. If #A < #A™ and #(AUA7) < #(BUB_), 
then #(aF \ F) > #(BUB_)- #A\| 


#3. Assume that H is discrete, and that H is isomorphic to a (not neces- 
sarily discrete) subgroup of SL(@,R). Show: 
a) H is amenable if and only if H has no nonabelian, free subgroups. 
b) H is amenable if and only if H has a solvable subgroup of finite 
index. 
[Hint: Tits Alternative (4.9.1).] 
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#4. Let G = SL(2,R) act on RU {00} by linear-fractional transformations, 
as usual. 
a) For u = [d+] € G, show that the only u-invariant probability 
measure on R U {oo} is concentrated on the fixed point of wu. 
b) Since the fixed point of u is not fixed by all of G, conclude that 
there is no G-invariant probability measure on R U {oo}. 


[Hint: (a) The action of u is conjugate to the homeomorphism ¢ in the hint 
to Exercise 1(a), so a similar argument applies. 


#5. Show that if a semisimple Lie group G is not compact, then every 
lattice [ in G is not amenable. 


#6. Give an example of a nonamenable Lie group that has a closed, cocom- 
pact, amenable subgroup. (By Proposition 12.4.7, the subgroup cannot 
be normal.) 


§12.5. Closed subgroups of amenable groups 


Before proving that closed subgroups of amenable groups are amenable 
(Proposition 12.2.8), we introduce some notation and establish a lemma. 
(Proofs for the case of discrete groups have already been given in Re- 
marks 12.3.13 and 12.3.24.) 


(12.5.1) Notation. 


1) We use £°(H;C) to denote the space of all measurable functions from 
the Lie group H to the compact, convex set C’, where two functions 
are identified if they are equal a.e. (with respect to the Haar measure 
on #). 

2) If A is a closed subgroup of H, and C is a A-space, then 

00 se is essentiall 
LMA C) = \¥ € LAC) | : eae \. 
(To say w is essentially A-equivariant means, for each A € A, that 
W(Ah) = A- V(h) for ae. h € H.) 


(12.5.2) Examples. 


1) Suppose #H is discrete. Then every function on H is measurable, so 
L°(H;C) = C* is the cartesian product of countably many copies 
of C. Therefore, in this case, Tychonoft’s Theorem (B5.2) implies that 
L°(H;C) is compact. 

2) If C is the closed unit disk in the complex plane (and H is arbitrary), 
then £°°(.H; C) is the closed unit ball in the Banach space £°°(H), so the 
Banach-Alaoglu Theorem (Proposition B7.4) states that it is compact 
in the weak* topology. 


More generally, if we put a technical restriction on C,, then there is a weak 
topology on £°(H;C’) that makes it into a compact, convex H-space: 
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(12.5.3) Lemma. Assume 

e A is a closed subgroup of H, 

e C is a nonempty, compact, convex H-space, and 

e C is contained in the dual of some separable Banach space B. 
Then £L°(H;C) and £LX(H;C) are nonempty, compact, convex H -spaces. 


Proof. Let £°(H;B*) be the space of all bounded measurable functions 
from H to B* (where two functions are identified if they are equal a.e.). This 
is the dual of the (separable) Banach space £1(H; B), so it has a natural weak* 
topology. Since £°(H;C) is a closed, bounded, convex subset of £°°(H; B*), 
the Banach-Alaoglu Theorem (B7.4) tells us that it is weak* compact. In 
addition, the action of H by right-translation on £°(H;C) is continuous 
(see Exercise 1). 

It is not difficult to see that L3{H;C) is a nonempty, closed, convex, 
H-invariant subset (see Exercise 3). 


Proof of Proposition 12.2.8. Let A be a closed subgroup of an amenable 
Lie group H. Given any continuous action of A on a compact, metrizable 
space X, it suffices to show there is a A-invariant probability measure on X 
(see Theorem 12.3.1(3)). From Lemma 12.5.3, we know that £°(H; Prob(X)) 
is a nonempty, compact, convex H-space. Therefore, the amenability of H 
implies that H has a fixed point ~ in LY(H;C). So yw is essentially H- 
invariant. If we fix any A € A, then, for a.e. h € H, we have 
A+ w(h) = w(Ah) (q is essentially A-equivariant) 
= (h) (q is essentially H-invariant). 

If we assume, for simplicity, that A is countable (see Exercise 4), then the 
quantifiers can be reversed (because the union of countably many null sets is a 
null set), so we conclude that the probability measure 7(h) is A-invariant. 


Exercises for §12.5. 


#1. Show that the action of H on £°(H;C) by right translations is contin- 
uous in the weak*-topology. 


#2. Suppose A is a closed subgroup of H, and that A acts measurably on 
a measure space 2. Show there is a A-equivariant, measurable map 
w: HQ. 
[Hint: w can be defined arbitrarily on a strict fundamental domain for A 
in H.| 

#3. Show that £Y(H;C) is a nonempty, closed, convex, H-invariant subset 
ObE GA C): 

#4. Prove Proposition 12.2.8 without assuming A is countable. 
[Hint: Consider 4 in a countable dense subset of A.] 
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§12.6. Equivariant maps from G/P to Prob(X) 


We now use amenability to prove a basic result that has important conse- 
quences for the theory of arithmetic groups. In particular, it is an ingredi- 
ent in two fundamental results of G. A. Margulis: his Superrigidity Theorem 
(16.1.6) and his Normal Subgroups Theorem (17.1.1). 


(12.6.1) Proposition (Furstenberg’s Lemma). Jf 
e P is a closed, amenable subgroup of G, and 
e T acts continuously on a compact metric space X , 


then there is a Borel measurable map w: G/P — Prob(X), such that w is 
essentially T-equivariant. 


Proof. Lemma 12.5.3 tells us that L2(G; Prob(X)) is a nonempty, compact, 
convex G-space. By restriction, it is also a nonempty, compact, convex P- 
space, so P has a fixed point wo (under the action by right-translation). Then 
Wo factors through to an (essentially) well-defined map 7: G/P — Prob(X). 
Because Wo is [-equivariant, it is immediate that ~w is [-equivariant. L 


In applications of Proposition 12.6.1, the subgroup P is usually taken to 
be a minimal parabolic subgroup. Here is an example of this: 


(12.6.2) Corollary. If 
e G=SL(3,R), 
e P= ie | CG, and 
* kK OK 
e T acts continuously on a compact metric space X , 
then there is a Borel measurable map ~: G/P — Prob(X), such that w is 
essentially T-equivariant. 


Proof. P is amenable (see Exercise 12.2414). O 


(12.6.3) Remark. The function w that is provided by Furstenberg’s Lemma 
(12.6.1) (or Corollary 12.6.2) can be thought of as being a “random” map 
from G/P to X; for each z € G/P, the value of w(z) is a probability distri- 
bution that defines a random value for the function at the point z. However, 
we will see in Section 16.7 that the theory of proximality makes it possible to 
show, in certain cases, that 7(z) is actually a single well-defined point of X, 
not a random value that varies over some range. 


Exercises for §12.6. 


#1. Show that every minimal parabolic subgroup of G is amenable. 
[Hint: Langlands decomposition (8.4.6).] 
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§12.7. More properties of amenable groups (optional) 


In this section, we mention (without proof, and without even defining all of 
the terminology) a variety of very interesting properties of amenable groups. 
For simplicity, 


we assume A is a discrete group. 


§12.7(i). Bounded harmonic functions. 


(12.7.1) Definition. Fix a probability measure jz on A. 


1) A function f: A > R is p-harmonic if f = uw * f. This means, for 
every A€ A, 


FA) = S— (a) f (aa). 


cen 
2) wis symmetric if u(A~*) = u(A) for every AC A. 
(12.7.2) Theorem. A is amenable if and only if there exists a symmetric 
probability measure on A, such that 
1) the support of uw generates A, and 
2) every bounded, p-harmonic function on A is constant. 
Because any harmonic function is the Poisson integral of a function on the 


Poisson boundary (and vice-versa), this result can be restated in the following 
equivalent form: 


(12.7.3) Corollary. A is amenable if and only if there exists a symmetric 
probability measure pp on A, such that 
1) the support of uw generates A, and 


2) the Poisson boundary of A (with respect to 1) consists of a single point. 


§12.7(ii). Norm of a convolution operator. 


(12.7.4) Definition. For any probability measure yz on A, there is a corre- 
sponding convolution operator C,, on £7(A), defined by 


(Cuf)) = >> ule) fa). 
cel 


(12.7.5) Theorem. Let « be any probability measure on A, such that the 
support of p generates A. Then ||C,,|| = 1 if and only if A is amenable. 
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§12.7(iii). Spectral radius. In geometric terms, the following famous 
result characterizes amenability in terms of the spectral radius of random 
walks on Cayley graphs. 


(12.7.6) Theorem (Kesten). Let yw be a finitely supported, symmetric prob- 
ability measure on A, such that the support of w generates A. Then ps is 
amenable if and only if 
1/2n 
a (gr) H(g2) + (Gon) 
mim, | g1,-..sGan € supp pt 
9192 °° * Jan = € 


ol 


§12.7(iv). Positive-definite functions. 


(12.7.7) Definition (cf. Terminology 13.6.4). A C-valued function y on A is 
positive-definite if, for all a,,...,a, € A, the matrix 
laijej—1 = [p(a;*a;)] 


is Hermitian and has no negative eigenvalues. 


(12.7.8) Theorem. A is amenable if and only if eA y(g) > 0 for every 
(finitely supported) positive-definite function p on A. 


§12.7(v). Growth. 


(12.7.9) Definition. Assume A is finitely generated, and fix a symmetric 
generating set S for A. 


1) For each r € Z™, let B,(A) be the ball of radius r centered at e, More 
precisely, 
B,(A;S) ={XAEA | As1, 59,...,8, E SU {e}, A = 8159---S, }. 


2) We say A has subexponential growth if for every « > 0, we have 
#B,(A;S') < e°, for all sufficiently large r € ZT. 


(12.7.10) Proposition (see Exercise 1). If A has suberponential growth, then 
A is amenable. 


(12.7.11) Warning. The implication in Proposition 12.7.10 goes only one 
direction: there are many groups (including many solvable groups) that are 
amenable, but do not have subexponential growth (see Exercise 2). 
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§12.7(vi). Cogrowth. 


(12.7.12) Definition. Assume A is finitely generated. Let: 

1) S = {s1,52,...,5%} be a finite generating set of A. 

2) Fy be the free group on & generators £1,..., 2x. 

3) @g: Fy, + A be the homomorphism defined by $(2;) = 5;. 
The cogrowth of A (with respect to S$’) is 


. dl 4 4 
au r logo,1 #((ker os) N By (Fr; eee ee) 


Note that #B,.(Fy; ie daca lig =) is equal to the number of reduced words 
of length r in the symbols af’,... ae which is approximately (2k — 1)". 
Therefore, it is easy to see that that the cogrowth of A is between 0 and 1 
(see Exercise 3). The maximum value is obtained if and only if A is amenable: 


(12.7.13) Theorem. A is amenable if and only if the cogrowth of A is 1, with 
respect to some (or, equivalently, every) finite generating set S. 


§12.7(vii). Unitarizable representations. 
(12.7.14) Definition. Let p: A — B(H) be a (not necessarily unitary) rep- 
resentation of A on a Hilbert space H. 


1) pis uniformly bounded if there exists C' > 0, such that ||p(A)|| < C, 
for all A € A. 


2) pis unitarizable if it is conjugate to a unitary representation. This 
means there is an invertible operator T’ on H, such that the represen- 
tation AW T7! p(A) T is unitary. 


It is fairly obvious that every unitarizable representation is uniformly 
bounded (see Exercise 4). The converse is not true, although it holds for 
amenable groups: 


(12.7.15) Theorem. Jf A is amenable, then every uniformly bounded repre- 
sentation of A is unitarizable. 


(12.7.16) Remark. The converse of Theorem 12.7.15 is an open question. 


§12.7(viii). Almost representations are near representations. 
(12.7.17) Definition. Fix « > 0, and let y be a function from A to the group 
U(H) of unitary operators on a Hilbert space H. 

1) v is €-almost a unitary representation if 


Ile(AzA2) _ p(A1) p(A2)|| < € for all Ai, AQ EA. 
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2) y is €-near a unitary representation if there exists a unitary represen- 
tation p: A: U(H), such that 


|~(A) — p(A)|| < € for every A € A. 
For amenable groups, every almost representation is near a representation: 


(12.7.18) Theorem. Assume A is amenable. Given € > 0, there exists 6 > 0, 
such that if yp is 6-almost a unitary representation, then — is €-near a unitary 
representation. 


§12.7(ix). Bounded cohomology. The bounded cohomology groups 
of A are defined just like the ordinary cohomology groups, except that all 
cochains are assumed to be bounded functions. 

(12.7.19) Definition. Assume B is a Banach space. 


1) Bisa Banach A-module if A acts continuously on B, by linear isome- 
tries. 


2) Zbaa(A; B) = 2(A; B) NO L(A; B). 

3) Hbaa(A; B) = Zbaa(A; B)/ B'(A; B). 
(12.7.20) Theorem. A is amenable if and only if Hiaa(A; B) = 0 for every 
Banach A-module B. 


(12.7.21) Remark. In fact, if A is amenable, then Hqq(A; B) = 0 for all n. 


§12.7(x). Invariance under quasi-isometry. 


(12.7.22) Proposition (see Exercise 6). Assume A; and Ag are finitely gen- 
erated groups, such that Ay is quasi-isometric to Ag (see Definition 10.1.8). 
Then A, is amenable if and only if Ag is amenable. 


§12.7(xi). Ponzi schemes. Assume A is finitely generated, and let d 
be the word metric on A, with respect to some finite, symmetric generating 
set S (see Definition 10.1.1). 


(12.7.23) Definition. A function f: A — A is a Ponzi scheme on A if 
there is some C' > 0, such that, for all A € A, we have: 

1) #f7-1(\) > 2, and 

2) d(f(A), A) < C. 


(12.7.24) Theorem. A is amenable if and only if there does not exist a Ponzi 
scheme on A. 
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Exercises for §12.7. 


#1. Prove Proposition 12.7.10. 
[Hint: If no balls are Fglner sets, then the group has exponential growth.] 


#2. Choose a prime number p, and let 


A={ i 7) | kmnez} C SL(2, Q), 


with the discrete topology. Show A is an amenable group that does not 
have subexponential growth. 


#3. In the notation of Definition 12.7.12, show: 
) SB ie Se cge = DhORS 1), 
b) If cog A is the cogrowth of A, then 0 < cogA < 1. 


#4. Show that every unitarizable representation is uniformly bounded. 


#5. For every € > 0, show there exists 6 > 0, such that if y is d-near a 
unitary representation, then vy is e-almost a unitary representation. 


#6. Prove Proposition 12.7.22. 


[Hint: Show that if A is not amenable, then, for every k, it has a finite 
subset S, such that #(SF) > k-#F for every finite subset F' of A.| 


#7. Explicitly construct a Ponzi scheme on the free group with two gener- 
ators. 


#8. Show (without using Theorem 12.7.24) that if A is amenable, then there 
does not exist a Ponzi scheme on A. 
[Hint: Fglner sets.] 


Notes 


The notion of amenability is attributed to J. von Neumann [20], but he 
used the German word “messbar” (which can be translated as “measurable” ). 
The term “amenable” was apparently introduced into the literature by M. Day 
[4, #507, p. 1054] in the announcement of a talk. 

The monographs [16, 17] are standard references on amenability. Briefer 
treatments are in [2, App. G], [6], and [22, §4.1]. Quite a different approach 
to amenability appears in [21, Chaps. 10-12] (for discrete groups only). 

The fact that closed subgroups of amenable groups are amenable (Propo- 
sition 12.2.8) is proved in [6, Thm. 2.3.2, pp. 30-32], [17, Prop. 13.3, p. 118], 
and [22, Prop. 4.2.20, p. 74]. 

See [6, p. 67] for a proof of Proposition 12.3.22(=-) that does not require 
Hf to be discrete. 

Remark 12.3.17 is proved in [6, pp. 46-47]. 

The solution of Exercise 12.3417 can be found in {17, Thm. 5.4, p. 45]. 
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For a proof of the fact (mentioned in the hint to Exercise 12.3421) that 
the one-to-one continuous image of a Borel set is Borel, see [1, Thm. 3.3.2, 
p. 70]. 

Our proof of Proposition 12.3.22(=) is taken from [6, pp. 66-67]. 

Remark 12.4.3(1), the existence of a nonamenable group with no non- 
abelian free subgroup, is due to Olshanskii [15]. (In this example, called 
an “Olshanskii Monster” or “Tarski Monster? every proper subgroup of the 
group is a cyclic group of prime order, so there is obviously no free subgroup.) 
A much more elementary example has recently been constructed by N. Monod 
[13]. 

The book of S. Wagon [21] is one of the many places to read about the 
Banach-Tarski Paradox (Remark 12.4.3(3)). 

Furstenberg’s Lemma (12.6.1) appears in [5, Thm. 15.1]. Another proof 
can be found in [22, Prop. 4.3.9, p. 81]. 

Theorem 12.7.2 is due to Kaimanovich and Vershik [10, Thms. 4.2 and 4.4] 
and (independently) Rosenblatt [19, Props. 1.2 and 1.9 and Thm. 1.10]. 

Theorem 12.7.5 is due to H. Kesten (if 4 is symmetric). See [2, G.4.4] for 
a proof. 

A proof of Proposition 12.7.10 can be found in [17, Props. 12.5 and 12.5]. 

Theorem 12.7.13 was proved by R.I. Grigorchuk [8] and J. M. Cohen [3] 
(independently). 

Theorem 12.7.15 was proved by J. Dixmier and M. Day in 1950 (indepen- 
dently). See [18] for historical remarks and progress on the converse. (Another 
result on the converse is proved in [14].) 

Theorem 12.7.18 is due to D. Kazhdan [11]. 

Theorem 12.7.20 and Remark 12.7.21 are due to B.E. Johnson [9]. See 
[12] (and its many references) for an introduction to bounded cohomology. 

Theorem 12.7.24 appears in [7, 6.17 and 6.173, p. 328]. 
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Chapter 13 


Kazhdan’s Property (T’) 


Recall that if a Lie group H is not amenable, then £°H) does not have 
almost-invariant vectors (see Theorem 12.3.1(6)). Kazhdan’s property (T) is 
the much stronger condition that no unitary representation of H has almost- 
invariant vectors (unless it has a vector that is fixed by H). Thus, in a sense, 
Kazhdan’s property is the antithesis of amenability. 

We already know that [ is not amenable (unless it is finite) (see Ex- 
ercise 12.445). In this chapter, we will see that [ usually has Kazhdan’s 
Property (TJ), and we will look at some of the consequences of this. 


§13.1. Definition and basic properties 


Part (1) of the following definition is repeated from Definition 12.3.14, but 
the second half is new. 


(13.1.1) Definition. Let H be a Lie group. 
1) An action of H on a normed vector space B has almost-invariant 
vectors if, for every compact subset C of H and every « > 0, there is a 
unit vector v € 6, such that 


lcu—v||<e forallc EC. (13.1.2) 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [is a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: Unitary representations (Sections 11.1, 
11.3, and 11.5). 
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(A unit vector satisfying (13.1.2) is said to be (€, C)-invariant.) 


2) H has Kazhdan’s property (T) if every unitary representation of H 
that has almost-invariant vectors also has (nonzero) invariant vectors. 


We often abbreviate “Kazhdan’s property (7')” to “Kazhdan’s property’ Also, 
a group that has Kazhdan’s property is often said to be a Kazhdan group. 


(13.1.3) Warning. By definition, unitary representations are actions on 
Hilbert spaces, so Kazhdan’s property says nothing at all about actions on 
other types of topological vector spaces. In particular, there are actions of 
Kazhdan groups by norm-preserving linear transformations on some Banach 
spaces that have almost-invariant vectors, without having invariant vectors 
(see Exercise 1). On the other hand, it can be shown that there are no such 
examples on £? spaces (with 1 < p < ov). 


(13.1.4) Proposition. A Lie group is compact if and only if it is amenable 
and has Kazhdan’s property. 


Proof. Exercises 2 and 3. O 


(13.1.5) Corollary. A discrete group A is finite if and only if it is amenable 
and has Kazhdan’s property. 


(13.1.6) Example. Z” does not have Kazhdan’s property, because it is a 
discrete, amenable group that is not finite. 


(13.1.7) Proposition. Jf A is a discrete group with Kazhdan’s property, then: 
1) every quotient A/N of A has Kazhdan’s property, 
2) the abelianization A/|A, A] of A is finite, and 
3) A is finitely generated. 


Proof. For (1) and (2), see Exercises 4 and 6. 

(3) Let {A,,} be the collection of all finitely generated subgroups of A. We 
have a unitary representation of A on each £°(A/A,), given by (yf)(vAn) = 
f(y~'wvAn). The direct sum of these is a unitary representation on 

H = L7(A/A1) B L{A/A2) O- +. 

Any compact set C' C A is finite, so we have C’ C A, for some n. Then 
C fixes the base point p = A,,/A,, in A/Ajy, so, letting f = 6, be a nonzero 
function in £7 A/A,,) that is supported on {p}, we have yf = f for all y € C. 
Therefore, H has almost-invariant vectors, so there must be an H-invariant 
vector in H. 

So some £°(A/A,,) has an invariant vector. Since A is transitive on A/Ap, 
an invariant function must be constant. So a (nonzero) constant function is 
in £°(A/A,,), which means A/A,, is finite. Because A,, is finitely generated, 
this implies that A is finitely generated. L] 
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Since the abelianization of any (nontrivial) free group is infinite, we have 
the following example: 


(13.1.8) Corollary. Free groups do not have Kazhdan’s property. 


(13.1.9) Remark. Proposition 13.1.7 can be generalized to groups that are not 
required to be discrete, if we replace the word “finite” with “compact” (see 
Exercises 5, 7, and 15). This leads to the following definition: 


(13.1.10) Definition. A Lie group H is compactly generated if there exists 
a compact subset that generates H. 


(13.1.11) Warning. Although discrete Kazhdan groups are always finitely 
generated (see Proposition 13.1.7(3)), they need not be finitely presented. 
(In fact, there are uncountably many non-isomorphic discrete groups with 
Kazhdan’s property (7), and only countably many of them can be finitely 
presented.) However, it can be shown that every discrete Kazhdan group is 
a quotient of a finitely presented Kazhdan group. 


Exercises for §13.1. 


#1. Let Co(H) be the Banach space of continuous functions on H that tend 
to 0 at infinity (with the supremum norm). Show: 
a) Co(H) has almost-invariant vectors of norm 1, but 
b) Co(#) does not have H-invariant vectors other than 0, unless H 
is compact. 
[Hint: Choose a uniformly continuous function f(h) that tends to +00 as 
h leaves compact sets. For large n, the function h ++ n/(n + f(h)) is almost 
invariant.| 
#2. Prove Proposition 13.1.4(=). 
[Hint: If H is compact, then almost-invariant vectors are invariant.| 


#3. Prove Proposition 13.1.4(<). 


[Hint: Amenability plus Kazhdan’s property implies £7(H) has an invariant 
vector. | 


#4. Prove Proposition 13.1.7(1) 


[Hint: Any representation of A/N is also a representation of A.] 
#5. Show that if H has Kazhdan’s property, and N is a closed, normal 
subgroup of H, then H/N has Kazhdan’s property. 
#6. Prove Proposition 13.1.7(2). 
[Hint: A/[A, A] is amenable and has Kazhdan’s property. 
#7. Show that if H has Kazhdan’s property, then H/|H, H] is compact. 
#8. Show that if N is a closed, normal subgroup of H, such that N and 


H/N have Kazhdan’s property, then H has Kazhdan’s property. 
[Hint: The space of N-invariant vectors is H-invariant (why’).| 
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#9. 


4£10. 


#11. 


#12. 


4£13. 


414. 


#15. 


4£16. 
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Warning. The converse is not true: there are examples in which a nor- 
mal subgroup of a Kazhdan group is not Kazhdan (see Exercise 13.3745). 


Show that H; x H2 has Kazhdan’s property if and only if H, and H2 
both have Kazhdan’s property. 


Let (7,V) be a unitary representation of a Kazhdan group H. Show 
that almost-invariant vectors in V are near invariant vectors. More 
precisely, given € > 0, find a compact subset C' of H and 6 > 0, such 
that if v is any (6,C)-invariant vector in V, then there is an invariant 
vector vo in V, such that ||v — vol] < e. 


[Hint: There are no almost-invariant unit vectors in (V%)+, the orthogonal 
complement of the space of invariant vectors.| 


Suppose S' is a generating set of a discrete group A, and A has Kazh- 
dan’s property. Show there exists « > 0, such that if a is any unitary 
representation of A that has an (e¢, $)-invariant vector, then 7 has an 
invariant vector. (The point here is to reverse the quantifiers: the same 
€ works for every 7.) Such an € is called a Kazhdan constant for A. 


Recall that we say H has the Haagerup property if it has a unitary 
representation, such that there are almost-invariant vectors, and all 
matrix coefficients decay to 0 at oo. Show that if H is a noncompact 
group with Kazhdan’s property, then H does not have the Haagerup 
property. 
Assume: 

e vy: H, — Hz is a homomorphism with dense image, and 

e H, has Kazhdan’s property. 
Show Hy» has Kazhdan’s property. 


Show that a Lie group H is compactly generated if and only if H/H° 
is finitely generated. 


[Hint: (<) Since H° is connected, it is generated by any subset with 
nonempty interior.| 


Show that every Lie group with Kazhdan’s property is compactly gen- 
erated. 


[Hint: Either adapt the proof of Proposition 13.1.7(3), or use Proposi- 
tion 13.1.7(3) together with Exercises 5 and 14.] 


Assume [I has Kazhdan’s property (T), and S is a finite generating set 
for [. Show there exists € > 0, such that if N is any finite-index normal 
subgroup of [, and A is any subset of ['/N, then 

#(SAU A) >min{(l+e)-#A, 3|T/N|}. 
(In graph-theoretic terminology, this means the Cayley graphs Cay(I'/N;; S) 


form a family of expander graphs if Ni, N2,... are finite-index normal sub- 
groups, such that |[./N;| + co.) 
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§13.2. Semisimple groups with Kazhdan’s property 


(13.2.1) Theorem (Kazhdan). SL(3,R) has Kazhdan’s property. 


This theorem is an easy consequence of the following lemma, which will 
be proved in Section 13.3. 


(13.2.2) Lemma. Assume 


ez isa unitary representation of the natural semidirect product 


* KK 
SL(2, IR) x R? = F * | Cc SL(3, R), 
001 
and 


e a has almost-invariant vectors. 


Then 7 has a nonzero vector that is invariant under the subgroup R?. 


Other terminology. Suppose R is a subgroup of a topological group H. The 
pair (H, R) is said to have relative property (T) if every unitary representa- 
tion of H that has almost-invariant vectors must also have an R-invariant vec- 
tor. In this terminology, Lemma 13.2.2 states that the pair (SL(2, R) kK R?, R?) 
has relative property (7). 


Proof of Theorem 13.2.1. Let 
10* 
G=SL(3,R), R= ) i | o& R?, and H = SL(2,R) x R, 


and suppose 7 is a unitary representation of G that has almost-invariant 
vectors. Then it is obvious that the restriction of 7 to H also has almost- 
invariant vectors (see Exercise 1), so Lemma 13.2.2 implies there is a nonzero 
vector v that is fixed by R. Then the Moore Ergodicity Theorem (11.2.8) 
implies that v is fixed by all of G. So 7 has a fixed vector (namely, wv). 


If G is simple, and rankp G > 2, then G contains a subgroup isogenous 
to SL(2,R) x R”, for some n (cf. Exercise 2), so a modification of the above 
argument shows that G has Kazhdan’s property. On the other hand, it is 
important to know that not all simple Lie groups have the property: 


(13.2.3) Example. SL(2,R) does not have Kazhdan’s property. 


Proof. Choose a torsion-free lattice [ in SL(2, R). Then T is either a surface 
group or a nonabelian free group. In either case, ['/|I, I] is infinite, so T does 
not have Kazhdan’s property. Therefore, we conclude from Proposition 13.4.1 
below that SL(2,R) does not have Kazhdan’s property. 


Alternate proof. A reader familiar with the unitary representation theory 
of SL(2, R) can easily construct a sequence of representations in the principal 
series whose limit is the trivial representation. The direct sum of this sequence 
of representations has almost-invariant vectors. 
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We omit the proof of the following precise characterization of the semisim- 
ple groups that have Kazhdan’s property: 


(13.2.4) Theorem. G has Kazhdan’s property if and only if no simple factor 
of G is isogenous to SO(1,n) or SU(1,n). 


Exercises for §13.2. 


#1. Assume 7 is a unitary representation of H that has almost-invariant 
vectors, and L is a subgroup of H. Show that the restriction of 7 to L 
has almost-invariant vectors. 


#2. (Assumes familiarity with real roots) Assume G is simple. Show 
ranke G > 2 if and only if some connected subgroup L of G is isoge- 
nous to SL(2,R) and normalizes (but does not centralize) a nontrivial, 
unipotent subgroup U of G. 


[Hint: (=) An entire maximal parabolic subgroup of G normalizes a nontriv- 
ial unipotent subgroup. (<=) Construct two unipotent subgroups of G that 
both contain U, but generate a subgroup that is not unipotent.| 


#3. Suppose H is a closed, noncompact subgroup of G, and G is simple. 
Show that the pair (G, H) has relative property (7) if and only if G 
has Kazhdan’s property. 


#4. Suppose G has Kazhdan’s property. Show there is a compact subset C 
of G and some € > 0, such that every unitary representation of G with 
(e,C)-invariant vectors has invariant vectors. 


§13.3. Proof of relative property (T) 


In this section, we prove Lemma 13.2.2, thereby completing the proof that 
SL(3, R) has Kazhdan’s property (7). The argument relies on a decomposition 
theorem for representations of R”. 


Proof of Lemma 13.2.2. For convenience, let H = SL(2,R) x R?. Given 
a unitary representation (7,#) of H that has almost-invariant vectors, we 
wish to show that some nonzero vector in H is fixed by the subgroup R? 
of H. In other words, if we let E be the projection-valued measure provided 
by Proposition 11.5.2 (for the restriction of 7 to R*), then we wish to show 
E({0}) is nontrivial. 

Letting B(H) be the algebra of bounded linear operators on H, and using 
the fact that 7 has almost-invariant vectors, Exercise 1 provides a continuous, 
linear functional 4: B(H) — C, such that 


© \(Id) = 1, 
e \(£) > 0 for every orthogonal projection FE, and 


e \ is bi-invariant under H. (More precisely, for all hi,h2 € H and 
T € B(H) we have \(m(h1) T 7(h2)) = A(T).) 
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Now, let js be the composition of \ with E (that is, let w(A) = A(E(A)) for 
A CR”), so wis a finitely additive probability measure on R” (see Exercise 2). 
Since R? < H, there is an action of H on R? by conjugation. One can show 
that the probability measure ju is invariant under this action (see Exercise 3). 

On the other hand, the only SL(2, R)-invariant, finitely additive probabil- 
ity measure on R? is the point-mass supported at the origin (see Exercise 4). 
Therefore, we must have ({0}) = 1 4 0. Hence, E({0}) is nonzero, as 
desired. 


Exercises for §13.3. 


#1. Prove the existence of the linear functional 4: B(H) > C in the proof 
of Lemma 13.2.2. 
[Hint: For T € B(H), define An(T) = (Tun | un), where {un} is a sequence 
of unit vectors in H, such that ||7(h)vn — vn|| + 0 for every h € H. Let » be 
an accumulation point of {A,} in an appropriate weak topology.| 
#2. Let w be as defined near the end of the proof of Lemma 13.2.2. Show: 
a) p(R2) = 1. 
b) If A; and Ag are disjoint Borel subsets of R?, then we have (AU 
Ag) = #(A1) + p(A2). 
c) (A) > 0 for every Borel set A C R?. 


#3. Show the finitely additive measure yu in the proof of Lemma 13.2.2 is 
invariant under the action of H on R. 
[Hint: Since 


[ro dE(r") = | " (r) dE(r) = i t(h~'rh) dE(r) = 1(h-'rh) 


T 
R 


= m(h-*) ar) ah) = [ T(r) (1(h*) dE(rT) m(h)), 
R 
we have E(A") = 1(h~!) E(A) x(h) for A C R]] 

#4. Show that any SL(2,R)-invariant, finitely additive probability mea- 
sure i on R? is supported on {(0,0)}. 

[Hint: Let V = { (2, y) |y > |x|} andh= E il. Then h'V is disjoint from 
hiV fori #j€Z*, so u(V) =0. All of R? \ {(0,0)} is covered by finitely 
many sets of the form hV with h € SL(2,R).] 

#5. Show that the natural semidirect product SL(3, R) x R° has Kazhdan’s 
property. 

[Hint: We know SL(3, R) has Kazhdan’s property, and the proof of Lemma 13.2.2 
shows that the pair (SL(3,R) x R®,R*) has relative property (T).] 

#6. Show that the direct product SL(3,R) x R? does not have Kazhdan’s 
property. (Comparing this with Exercise 5 shows that, for group exten- 
sions, Kazhdan’s property may depend not only the groups involved, 
but also on the details of the particular extension.) 
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§13.4. Lattices in groups with Kazhdan’s property 


In this section, we will use basic properties of induced representations to prove 
the following important result: 


(13.4.1) Proposition. If G has Kazhdan’s property, then I has Kazhdan’s 
property. 


Combining this with Theorem 13.2.4, we obtain: 


(13.4.2) Corollary. If no simple factor of G is isogenous to SO(1,n) or 
SU(1,n), then T has Kazhdan’s property. 


By Proposition 13.1.7, this has two important consequences: 


(13.4.3) Corollary. If no simple factor of G is isogenous to SO(1,n) or 
SU(1,n), then 


1) T is finitely generated, and 
2) T/[L,T] ts finite. 


(13.4.4) Remarks. 


1) It was pointed out in Theorem 4.7.10 that (1) remains true without 
any assumption on the simple factors of G. In fact, I is always finitely 
presented, not merely finitely generated. 


2) On the other hand, (2) is not always true, because lattices in SO(1, 7) 
and SU(1,n) can have infinite abelian quotients. (In fact, it is conjec- 
tured that every lattice in SO(1,7) has a finite-index subgroup with an 
infinite abelian quotient, and this is known to be true when n = 3.) 
The good news is that the Margulis Normal Subgroup Theorem im- 
plies these are the only examples (modulo multiplying G by a compact 
factor) if we make the additional assumption that I is irreducible (see 
Exercise 16.143 or Exercise 17.1#1). 


The proof of Proposition 13.4.1 uses some machinery from the theory of 
unitary representations. 


(13.4.5) Notation. Let (7,#) and (0,) be unitary representations of a Lie 
group H. (In our applications, H will be either G or I.) 


1) We write o < 7 if a is (isomorphic to) a subrepresentation of 7. This 
means there exist 
e aclosed, H-invariant subspace H’ of H, and 
° a bijective, linear isometry T: K —> H’, 
such that T(o(h)¢) = 7(h) T(¢), for allh € H and g EK. 
2) We write o < 7 if o is weakly contained in 7. This means that, for 
every compact set C' in H, every € > 0, and all unit vectors ¢1,...,¢n € 
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K, there exist unit vectors ~1,...,Wn € H, such that, for all h € C and 
all 1 <i,7 <n, we have 
|(o(h) oi | 3) — (r(h) bi | d5)| <e. 
(13.4.6) Remarks. 
1) It is obvious that if o < 7, then o <7. 


2) We have: 
e 7 has invariant vectors if and only if 1 < 7, and 
e ~ has almost-invariant vectors if and only if 1 < 7. 
Therefore, Kazhdan’s property asserts the converse to (1) in the special 
case where o = 1: for all z, if 1 ~ 7, then 1 < 7. 


It is not difficult to show that induction preserves weak containment 
(see Exercise 1): 


(13.4.7) Lemma. If o < 7, then Ind€(c) ~ Ind&(r). 
This (easily) implies the main result of this section: 


Proof of Proposition 13.4.1. Suppose a representation 7 of I has almost- 
invariant vectors. Then 7 > 1, so 


Ind§(r) + Indf(1) = £°G/T) >1 
(see Exercises 2 and 11.3745). Because G has Kazhdan’s property, we conclude 
that Ind€(m) > 1. This implies 7 > 1 (see Exercise 3), as desired. 


(13.4.8) Remark. If [T has Kazhdan’s property, and S is any generating set 
of [, then there is some € > 0, such that every unitary representation of T° 
with an (¢,$)-invariant unit vector must have invariant vectors (see Exer- 
cise 13.1411). Our proof does not provide any estimate on ¢, but, in many 
cases, including [ = SL(n, Z), an explicit value of € can be obtained by work- 
ing directly with the algebraic structure of I (rather than using the fact that 
T is a lattice). 


(13.4.9) Remark. For many years, lattices (and some minor modifications of 
them) were the only discrete groups known to have Kazhdan’s property (7), 
but other constructions are now known. In particular: 


1) Groups can be defined by generators and relations. It can be shown 
that if the relations are selected at random (with respect to a certain 
probability distribution), then the resulting group has Kazhdan’s prop- 
erty ({) with high probability. 

2) An algebraic approach that directly proves Kazhdan’s property for 
SL(n, Z), without using the fact that it is a lattice, has been gener- 
alized to allow some other rings, such as polynomial rings, in the place 
of Z. In particular, SL(n, Z|X1,.-- , Xz]) has Kazhdan’s property (7) 
ifn >k+3. 
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(13.4.10) Remark. We saw in Proposition 12.7.22 that amenability is invariant 
under quasi-isometry (see Definition 10.1.3). In contrast, this is not true for 
Kazhdan’s property (T). To see this, let 


e G bea simple group with Kazhdan’s property (7), 

e G be the universal cover of G, 

e I’ be a cocompact lattice in G, and 

e I’ be the inverse image of [in G so T is a lattice in G. 


Then I has Kazhdan’s property (T’) (because G has the property). However, 
if G = Sp(4,R) (or, more generally, if the fundamental group of G is an 


infinite cyclic group), then T is quasi-isometric to [ x Z, which obviously does 
not have Kazhdan’s property (because its abelianization is infinite). 

Here is a brief explanation of why T is quasi-isometric to [ x Z. Note 
that T /Z@=T yields a 2-cocycle a: T x T > Z of group cohomology. Since 
G/T is compact, it turns out that a@ can be chosen to be uniformly bounded, 
as a function on [ x I. This implies that the extension T is quasi-isometric 
to the extension corresponding to the trivial cocycle. This extension is I’ x Z. 


Exercises for §13.4. 


#1. Prove Lemma 13.4.7. 
#2. Show 1 < £7(G/T). 


#3. Show that if 7 is a unitary representation of I, and 1 < Ind&(z), then 
1l<q. 


#4. Prove the converse of Proposition 13.4.1: Show that if [ has Kazhdan’s 
property, then G has Kazhdan’s property. 


[Hint: Any [T-invariant vector v can be averaged over G/T to obtain a G- 
invariant vector. If v is e-invariant for a compact set whose projection to G/T 
has measure > 1 — ¢, then the average is nonzero.| 


§13.5. Fixed points in Hilbert spaces 


We now describe an important geometric interpretation of Kazhdan’s prop- 
erty. 


(13.5.1) Definition. Let H be a Hilbert space. A bijection T: H — H is an 
affine isometry of H if there exist a unitary operator U on H, and b € H, 
such that 

T(v) =Uv+b for allv EH. 


(13.5.2) Example. Let wo be a nonzero vector in a Hilbert space H. For 
t € R, define an affine isometry ¢° of H by ¢'(v) = v + two; this yields an 
action of R on H by affine isometries. Since ¢1(v) = v + wo # v, we know 
that the action has no fixed point. 
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The main theorem of this section shows that the groups that do not 
have Kazhdan’s property are characterized by the existence of a fixed-point- 
free action as in Example 13.5.2. However, before stating the result, let us 
introduce some notation, so that we can also state it in cohomological terms. 


(13.5.3) Definition. Suppose (7,H) is a unitary representation of a Lie 
group H. Define 

1) C(H;H) = {continuous functions f: H > H}, 

2) Z(H; 1m) ={f € C(H;H) | Vg,h © H, f(gh) = f(g) + (9) f(A)}, 

3) B'(H;1) = {f € C(H;H) | du EH, VA CA, f(h) =v — r(h)v}, 

4) H'(H; 2) = 2'(H;7)/ B'(H; 7) (see Exercise 3). 


If the representation 7 on H is clear from the context, we may write Z \(H s FL): 
B\(H;H), and H'(H;H), instead of Z'(H;7), B\(H;7), and H'(H;7). 


(13.5.4) Theorem. For a Lie group H, the following are equivalent: 
1) H has Kazhdan’s property. 


2) For every Hilbert space H, every continuous action of H by affine isome- 
tries on H. has a fixed point. 


3) H'(H; 7) =0, for every unitary representation 7 of H. 
Proof of (2) => (3). Given f € Z'(H;7), define an action of f on H via 
affine isometries by defining 

hv =r(h)u+ f(h) forh € H andv eH 

(see Exercise 5). By assumption, this action must have a fixed point vo. 
For all h € H, we have vp = hug = 7(h)vo + f(h), so f(h) = vo — 7(h)v0. 
Therefore f € B'(H;7). Since f is an arbitrary element of Z'(H;7), this 
implies H'(H;7) = 0. 


Proof of (3) = > (1). We prove the contrapositive: assume H does not 
have Kazhdan’s property. This means a unitary representation of H on some 
Hilbert space H has almost-invariant vectors, but does not have invariant 
vectors. We claim H'(H;7~°) 4 0, where 7™ is the obvious diagonal action 
of H on the Hilbert space HH" =H@OHSG:--. 

Choose an increasing chain C; C C2 C--- of compact subsets of G, such 
that G = U,, Cn. For each n, since H has almost-invariant vectors, there 
exists a unit vector uv, € H, such that 


1 
|CUn — Un] < Te for allc € Ch. 
Now, define f: H > H™ by 
Gare — n(hvp oo Un) 


(see Exercise 9), so f € Z'(H; 7°) (see Exercise 10). However, it is easy to 
see that f is an unbounded function on H (see Exercise 11), so f ¢ B'(H;H™) 
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(see Exercise 4). Therefore f represents a nonzero cohomology class in 
HH; 1). O 


Alternate proof of (3) = > (1). Assume the unitary representation 7 
has no invariant vectors. (We wish to show this implies there are no almost- 
invariant vectors.) Define a linear map 

F:H— 2'(H;71) by F,(h) = x(h)u — v. 
Assume, for simplicity, that H is compactly generated (see Exercise 14), so 
some compact, symmetric set C' generates H. By enlarging C, we may assume 
C' has nonempty interior. Then the supremum norm on C' turns Z (A ;7) into 
a Banach space (see Exercise 12), and the map F is continuous in this topology 
(see Exercise 13(a)). 

Since there are no invariant vectors in H, we know that F is injective (see 
Exercise 13(b)). Also, the image of F is obviously B'(H;7). Since H'(H; 7) = 
0, this means that F’ is surjective. Therefore, F' is a bijection. So the Open 
Mapping Theorem (B7.6(2)) provides a constant «€ > 0, such that ||F,|| > € 
for every unit vector v. This means there is some h € C, such that ||7(h)v — 
v|| > €, so v is not (C,e€)-invariant. Therefore, there are no almost-invariant 
vectors. L 


Sketch of proof of (1) => (2). We postpone this proof to Section 13.6, 
where functions of positive type are introduced. They yield an embedding 
of H in the unit sphere of a (larger) Hilbert space H. This embedding is 
nonlinear and non-isometric, but there is a unitary representation 7 on H for 
which the embedding is equivariant. Kazhdan’s property provides an invariant 
vector in H, and this pulls back to a fixed point in H. See Section 13.6 for 
more details. L] 


(13.5.5) Remark. If H satisfies (2) of Theorem 13.5.4, it is said to have “prop- 
erty (F'H)” (because it has Fixed points on Hilbert spaces). 


In Definition 13.5.3, the subspace B'(H;7) may fail to be closed (see Ex- 
ercise 16). In this case, the quotient space H'(H;m) does not have a good 
topology. Fortunately, it can be shown that Theorem 13.5.4 remains valid 
even if we replace B'(H; 7) with its closure: 


(13.5.6) Definition. In the notation of Definition 13.5.3, let: 


1) B'(H;7) be the closure of B'(H;7) in Z'(H;7), and 
2) H (H: mt) = Z(H; 7)/B'(H; 1). This is called the reduced 1st cohomol- 
ogy. 


The following result requires the technical condition that H is compactly 
generated (see Definition 13.1.10 and Exercise 17). 
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(13.5.7) Theorem. A compactly generated Lie group H has Kazhdan’s prop- 
erty if and only if H (H; mw) =0, for every unitary representation x of H. 


Because reduced cohomology behaves well with respect to the direct in- 
tegral decomposition of a unitary representation (although the unreduced 
cohomology does not), this theorem implies that it suffices to consider only 
the irreducible representations of H: 


(13.5.8) Corollary. A compactly generated Lie group H has Kazhdan’s prop- 
erty if and only if H(H;7) = 0, for every irreducible unitary representa- 
tion w of H. 


(13.5.9) Remark. We have seen that a group with Kazhdan’s property has 
bounded orbits whenever it acts isometrically on a Hilbert space. The same 
conclusion has been proved for isometric actions on some other spaces, in- 
cluding real hyperbolic n-space 9”, complex hyperbolic n-space 9%, and all 
“median spaces” (including all R-trees). (In many cases, the existence of a 
bounded orbit implies the existence of a fixed point.) See Exercise 19 for an 
example. 


Exercises for §13.5. 


#1. Let T: H > H. Show that if T is an affine isometry, then 
a) T(vu—w) =T(v) —T(w) +T(0), and 
b) [[Z@) — TF) = lle - wl), 
for all v,w € H. 


#2. Prove the converse of Exercise 1. 


#3. In the notation of Definition 13.5.3, show that B'(H;7) C 2'(H;7) (so 
the quotient Z'(H;7)/B'(H;7) is defined). 


#4. Suppose f € B'(H;7) so f: H > H. Show f is bounded. 


#5. Suppose 
e (z,H) is a unitary representation of H, and 
et: HOH. 
For h € A and v € H, let a(h)v = m(h)v + T(h), so a(h) is an affine 
isometry of H. Show that a defines a continuous action of H on H if 
and only if 7 € Z'(H;7) and 7 is continuous. 


#6. Suppose H acts continuously by affine isometries on the Hilbert space H. 
Show there is a unitary representation 7 of H on H, and some T € 
2'(H;7), such that hv = 1(h) v + 7(h) for every h € H and v EH. 


#7. Suppose H acts continuously by affine isometries on the Hilbert space H. 
Show the following are equivalent: 
a) H has a fixed point in H. 
b) The orbit Hv of each vector v in H is a bounded subset of H. 
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#8. 


#9. 


4£10. 
#11. 


#12. 


#13. 


#614, 


#15. 
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c) The orbit Hv of some vector v in H is a bounded subset of H. 


[Hint: You may use (without proof) the fact that every nonempty, bounded 
subset X of a Hilbert space has a unique circumcenter. By definition, the 
circumcenter is a point c, such that, for some r > 0, the set X is contained 
in the closed ball of radius r centered at c, but X is not contained in any ball 
of radius < r (centered at any point). 


Prove directly that 13.5.4(3) = > 13.5.4(2), without using Kazhdan’s 
property. 

[Hint: For each h € H, there is a unique unitary operator 7(h), such that we 
have hv = m(h)v + h(0) for all v CH. Fix v € H and define f € Z(H; 7) by 
f(h) = hv —v. If f € B\(A; 7), then H has a fixed point.] 


In the notation of the proof of 13.5.4(3 > 1), show f: H > H®. 
[Hint: For each h € H, show the sequence {|| f(h)n||} is square-summable.] 


In the notation of the proof of 13.5.4(3 > 1), show f € H'(H; 7°). 


In the notation of the proof of 13.5.4(3 > 1), show f is unbounded. 


[Hint: You may use (without proof) the fact that every nonempty, bounded 
subset of a Hilbert space has a unique circumcenter, as in Exercise 7.| 


Assume C' is a compact, symmetric set that generates H, and has 
nonempty interior. For each f € Z'(H;H), let €(f) be the restric- 
tion of f to C. Show that € is a bijection from Z'(H;H) onto a closed 
subspace of the Banach space of continuous functions from C' to H. 


In the notation of the alternate proof of 13.5.4(3 > 1), show: 
a) F is continuous. 
b) F is injective. 

[Hint: (b) If F, = Fy, then what is r(h)(v — w)?] 


Remove the assumption that H is compactly generated from the alter- 
nate proof of (3) ==> (1). 


[Hint: The topology of uniform convergence on compact sets makes Z2'(H; 7) 
into a Frchet space.| 


Assume 

T’ has Kazhdan’s property T, 

V is a vector space, 

H is a Hilbert space that is contained in V, 

v € V, and 

a: + GL(V) is any homomorphism, such that 
o the restriction o(y)|z is unitary, for every y € T, and 
o H+ v is o(L)-invariant. 

Show o(L) has a fixed point in H + v. 

[Hint: Theorem 13.5.4(2).] 


4£16. 


#17. 


4£18. 


4£19. 


420. 


#21. 
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Show that if 7 has almost-invariant vectors, then B'(H;7) is not closed 
in 2'(H;7). 

[Hint: See the alternate proof of Theorem 13.5.4(3 => 1).] 

Show the assumption that H is compactly generated cannot be removed 
from the statement of Theorem 13.5.7. 

[Hint: Let H be an infinite, discrete group, such that every finitely generated 
subgroup of H is finite.] 

Show H has Kazhdan’s property if and only if H'(H;7) = 0, for every 
irreducible unitary representation 7 of H. 

[Hint: You may assume Theorem 13.5.7.| 


Definition. A tree is a contractible, 1-dimensional simplicial complex. 


(Watatani) Suppose 
e A is a discrete group that has Kazhdan’s property, and 
e acts by isometries on a tree T’. 
Show A has a fixed point in T (without assuming Remark 13.5.9). 


[Hint: Fix an orientation of T, and fix a vertex v in T. For each 4 € A, 
the geodesic path in T from v to A(v) can be represented by a {0, +1}-valued 
function P, on the set E of edges of T. Verify that A +> Py is in BailiNs L(E)), 
and conclude that the orbit of v is bounded.] 


Show SO(1,7) and SU(1,n) do not have Kazhdan’s property. 
[Hint: You may assume the facts stated in Remark 13.5.9.] 


It is straightforward to verify that all of the results in this chapter 
remain valid if we require H to be a real Hilbert space (instead of 
a Hilbert space over C), as in Assumption 13.6.1 below. In this set- 
ting, there is no need to restrict attention to affine isometries in the 
statement of Theorem 13.5.4(2), because all isometries are affine: 

Let H be a real Hilbert space, and let y: H — H be any distance- 
preserving bijection (so ||y(v) — y(w)|| = ||ju — w|| for all v,w € H). 
Show that vy is an affine isometry. 

[Hint: The main problem is to show that if y(0) = 0, then vy is R-linear. This 


is well known (and easy to prove) when H = R?. The general case follows 
from this.] 


§13.6. Functions on H that are of positive type 


This section completes the proof of Theorem 13.5.4, by showing that affine iso- 
metric actions of Kazhdan groups on Hilbert spaces always have fixed points. 
For this purpose, we develop some of the basic theory of functions of positive 


type. 


(13.6.1) Assumption. To simplify some details, Hilbert spaces in this section 


are assumed to be real, rather than complex. (That is, the field of scalars is R, 


rather than C.) 
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(13.6.2) Definition. 
1) Let A be an n x n real symmetric matrix. 
(a) A is of positive type if (Av | v) > 0 for all v € R”. Equivalently, 
this means all of the eigenvalues of A are > 0 (see Exercise 1). 
(b) A is conditionally of positive type if 
(i) (Av | v) > 0 for all v = (v1,..., Un) € R”, such that we have 
Vy +-+++Un = 0, and 
(ii) all the diagonal entries of A are 0. 
(The word “conditionally” refers to the fact that the inequality on 
(Av | v) is only required to be satisfied when a particular condition 
is satisfied, namely, when the sum of the coordinates of v is 0.) 


2) A continuous, real-valued function y on a topological group H is said 
to be of positive type (or conditionally of positive type, respec- 
tively) if, for all n and all hi,...,hn € H, the matrix (y(h; *h;)) is a 
symmetric matrix of the said type. 

(13.6.3) Warning. A function that is of positive type is almost never condi- 
tionally of positive type. This is because a matrix satisfying (la) of Defini- 
tion 13.6.2 will almost never satisfy (1(b)ii) (see Exercise 2). 


(13.6.4) Other terminology. Functions of positive type are often called 
positive definite or positive semi-definite. 


Such functions arise naturally from actions of H on Hilbert spaces: 
(13.6.5) Lemma. Suppose 

e H is a topological group, 

e H acts continuously by affine isometries on a Hilbert space H, 

eveH, 


e vo: H SR is defined by y(h) = —||hv — v||? for h € H, and 
ew: H > R is defined by w(h) = (hu | v) forh Ee H. 
Then: 


1) y is conditionally of positive type, and 
2) w is of positive type if h(0) =0 for allh € H. 


Proof. Exercises 6 and 7. L] 
Conversely, the following result shows that all functions of positive type 


arise from this construction. (The “GNS” in its name stands for Gelfand, 
Naimark, and Segal.) 


(13.6.6) Proposition (“GNS construction”). If f: H > R is of positive type, 
then there exist 


e a continuous action of H by linear isometries on a Hilbert space H (so 
h(0O) =0 for all h), and 
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eveH, 
such that f(h) = (hv |v) for allh € H. 


Proof. Let R[H] be the vector space of functions on H with finite support. 
Since the set of delta functions { 6, | h € H} is a basis, there is a unique 
bilinear form on R[H], such that 


(One: Ope = f(hg he) for all hy, ho. 


Since f is of positive type, this form is symmetric and satisfies the inequality 

(w | w) > 0 for all w. Let Z be the radical of the form, which means 
Z={2€ RA] | (z| z) = 0}, 

so (|) factors through to a well-defined positive-definite, symmetric bilinear 

form on the quotient R[H]/Z. This makes the quotient into a pre-Hilbert 

space; let H be its completion, which is a Hilbert space, and let v be the 

image of 6, in H. 

The group H acts by translation on R[H], and it is easy to verify that the 
action is continuous, and preserves the bilinear form (see Exercise 8). There- 
fore, the action extends to a unitary representation of H on H. Furthermore, 
for any h € H, we have 


f(h) = fle-h) = (e | bn) = We | hdc) = (v | hv) = (hv | 0), 


as desired. 


We will also use the following important relationship between the two 
concepts: 


(13.6.7) Lemma (Schoenberg’s Lemma). If y is conditionally of positive type, 
then e? is of positive type. 


Proof. A function «: H x H — R is said to be a kernel of positive type 
if the matrix («(hi, h;)) is a symmetric matrix of positive type, for all n and 
all hy,..., hn € A. 

Define k: H x H + R by 


K(9,h) = p(g-*h) — 9(g) — 9h). 
Then: 
e « is a kernel of positive type (see Exercise 9), 
e so e” is a kernel of positive type (see Exercise 11), 
e and e?e% ") is a kernel of positive type (see Exercise 12), 


e so the product e%(9-”) (cP evlh)) is a kernel of positive type (see Exer- 
cise 10). 


This product is er(g th). so e® is a function of positive type. 
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With these tools, it is not difficult to show that affine isometric actions 
of Kazhdan groups on Hilbert spaces always have fixed points: 


Proof of Theorem 13.5.4 (1 = 2). Let a be the given action of H on H 
by affine isometries, and let 7 be the corresponding unitary representation 
(see Exercise 13.546). Therefore, we have 
a(h)v =7(h)vu+7(h) forh € A andveH, 

where 7 € Z(H; 7). 

Let H =H ™x H be the semidirect product of (the additive group of) H 
with H, where H acts on H via 7. This means the elements of H are the 
ordered pairs (v,h), and, for v1, v2 € H and hy, he € H, we have 


(v1, h1) + (v2, he) = (v1 + (hi) v2, hihg). 
This semidirect product is a topological group, so we can apply the above 
theory of functions of positive type to it. Define a continuous action @ of H 
on H by 
a(v,h)w=a(h)w+u (13.6.8) 
(see Exercise 3), and define 
@: H + R by Gv, h) = —|@(v, h)(0)|?. 
Since @(v, h) is an affine isometry for every v and h, we know ¢ is condition- 
ally of positive type (see Lemma 13.6.5(1)). Therefore e” is of positive type 
(see Lemma 13.6.7). Hence, the GNS construction (13.6.6) provides a unitary 
representation 7 of H on a Hilbert space H and some 6 € H, such that 
(#(v, h)6 | 6) =e?) for allu EH andh € H. (13.6.9) 
We now define 
®:H—-H by ®(v) = 77, e)d. 
We have 
®(a(h)v) = 7(0,h) (v) for h € H andveH (13.6.10) 
(see Exercise 14), so ® converts the affine action of H on H to a linear action 
on H. Since the linear span of ®(H) contains 6 and is invariant under a (H ) 


(see Exercise 4), there is no harm in assuming that its closure is all of H. 
It is clear from the definition of ¢ that ¢(0,e) = 0, so we know that é is 
a unit vector. Therefore 


\|7(v, h)o — 6||? = (#(v, h)t —6, Tv, h)o — o) 
= 2(1— (#(v,h)6 | 6)) (13.6.11) 
= 2(1— e%™")), 
Since H has Kazhdan’s property, there is a compact subset C of H and 
some € > 0, such that every unitary representation of H that has a (C,€)- 


invariant vector must have an invariant vector. There is no harm in multi- 
plying the norm on H by a small positive scalar, so we may assume ((0, h) 
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is as close to 0 as we like, for all h € C. Then (13.6.11) tells us that 6 is 
(C, €)-invariant, so H must have a nonzero H-invariant vector Bo. 

Suppose the affine action a does not have any fixed points. (This will 
lead to a contradiction.) This implies that every H-orbit on H is unbounded 
(see Exercise 13.57#¢7). Hence, for any fixed v € H, and all h € H, we have 


(B(v) | do) = (®(v) | F(0, A+) Go) (09 is H-invariant) 
= (7(0, h) ®(v) | do) (7 is unitary) 
= (®(a(h)v) | 60) (13.6.10) 
+0 as |la(h)v|| > co (Exercise 5). 


So to is orthogonal to the linear span of ®(H), which is dense in H. Therefore 
0p = 0. This is a contradiction. 


Exercises for §13.6. 


#1. Let A be a real symmetric matrix. Show A is of positive type if and 
only if all of the eigenvalues of A are > 0. 
[Hint: A is diagonalizable by an orthogonal matrix.] 


#2. Suppose A is an n x n real symmetric matrix that is of positive type 
and is also conditionally of positive type. Show A = 0. 
[Hint: What does 13.6.2(1(b)ii) say about the trace of A?] 


#3. In the notation of the proof of Theorem 13.5.4, show 
a(vi, hy) : A(ve, hz) = Q((v1, hy) : (ve, hz)) 
for all 01,02 €H and hy, he € H. 


#4. In the notation of the proof of Theorem 13.5.4, show that the linear 
span of ®(H) is invariant under 7(H). 

[Hint: See (13.6.10).] 

#5. In the notation of the proof of Theorem 13.5.4, show that if v € H, 
and {w,} is a sequence in H, such that ||w,|| > co, then ®(w,) > 0 
weakly. 

[Hint: If @ € ®(H), then (13.6.9) implies (®(w,) | @) > 0.] 

#6. Prove Lemma 13.6.5(1). 


[Hint: If 2, ti = 0, then Do, , tity p(hz thyv) = 2 ||, tehiol]” 


#7. Prove Lemma 13.6.5(2). 
| Hint: ye tj tj w(h; 'hjv) = be tyhyv 


2 
I" 
#8. Prove that the action of H acts on R[H] by translation is continu- 


ous, and preserves the bilinear form defined in the proof of Proposi- 
tion 13.6.6. 
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#9. Show that the kernel « in the proof of Lemma 13.6.7 is of positive type. 
[Hint: For v1,...,Un € R and hi,...,hn € H, let vo = — >>; and ho = e. 
Then 57, , vivj(hi,hj) = 0 since ~ is conditionally of positive type and 
Yo ui = 0. However, the terms with either 1 = 0 or 7 = O have no net 
contribution, since y(e) = 0.] 


#10. Show that if « and X are kernels of positive type, then «A is a kernel of 
positive type. 
[Hint: Given hi,...,hn € H, show there is a matrix L, such that L? = 
(A(hi, h;)). Note that , Ui U5 K(ha, h;) A(hi, h;) = No. are (Lieve) (L439) K(hi, h;) > 
0.] 


#11. If & is a kernel of positive type, show e” is a kernel of positive type. 
[Hint: Since «” is a kernel of positive type for all n, the same is true of 


YK" /n!.] 
#12. For every y: H > R, show ¢y(g) y(h) is a kernel of positive type. 


#13. Suppose y: H > R. Prove the following converse of Lemma 13.6.7: If 
e'? is of positive type for all t > 0, then gy is conditionally of positive 
type. 


[Hint: Verify that e’* —1 is conditionally of positive type. Then lim,_,9+(e"” — 
1)/t has the same type.| 


#14. Verify (13.6.10). 


[Hint: Since @(—r(h), h)(0) = 0, we have (7(e, h)é | 7(7(h), e)6) = 1. Since 
they are of norm 1, the two vectors must be equal.] 


#15. Recall that a Lie group H has the Haagerup property if it has a 
unitary representation, such that there are almost-invariant vectors, 
and all matrix coefficients decay to 0 at oo. It is known that H has the 
Haagerup property if and only if it has a continuous, proper action by 
affine isometries on some Hilbert space. Prove the implication (<=) of 
this equivalence. 


Notes 


The monograph [4] is the standard reference on Kazhdan’s property (T). 
The property was defined by D. Kazhdan in [11], where Propositions 13.1.7 
and 13.4.1 and Theorem 13.2.1 were proved. 

See [2] for a discussion of the generalization of Kazhdan’s property to 
actions on Banach spaces, including Warning 13.1.3 and Exercise 13.141. 
See [6] for a discussion of the Haagerup property that is mentioned in Ex- 
ercises 13.1412 and 13.6#15. See [18] for much more information about 
expander graphs and their connection with Kazhdan’s property, mentioned 
in Exercise 13.1416. 
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Regarding Warning 13.1.11: 


e By showing that SL(3, F,[t]) is not finitely presentable, H. Behr [3] 
provided the first example of a group with Kazhdan’s property that is 
not finitely presentable. 


e The existence of uncountably many Kazhdan groups was proved by 
M. Gromoy [9, Cor. 5.5.E, p. 150]. More precisely, any cocompact lattice 
in Sp(1,n) has uncountably many different quotients (because it is a 
“hyperbolic” group), and all of these quotient groups have Kazhdan’s 


property. 


e Y.Shalom [20, p. 5] proved that every discrete Kazhdan group is a 
quotient of a finitely presented Kazhdan group. The proof can also be 
found in [4, Thm. 3.4.5, p. 187]. 


Our proof of Theorem 13.2.1 is taken from [4, §1.4]. 

Theorem 13.2.4 appears in [4, Thm. 3.5.4, p. 177]. It combines work of 
several people, including Kazhdan [11] and Kostant [14, 15]. See [4, pp. 5-7] 
for an overview of the various contributions to this theorem. 

A detailed solution of Exercise 13.3#1 can be found in [4, Lem. 1.4.1]. 

See [4, Thm. 1.7.1, p. 60] for a proof of Proposition 13.4.1 and its converse 
(Exercise 13.4#4). 

Regarding Remark 13.4.4(2), see [17] (and its references) for a discussion 
of W. Thurston’s conjecture that lattices in SO(1,n) have finite-index sub- 
groups with infinite abelian quotients. (For n = 3, the conjecture was proved 
by Agol [1].) Lattices in SU(1,) with an infinite abelian quotient were found 
by D. Kazhdan [12]. 

Explicit Kazhdan constants for SL(n,Z) (cf. Remark 13.4.8) were first 
found by M. Burger |7, Appendix] (or see [4, §4.2]). An approach developed 
by Y. Shalom (see [21]) applies to more general groups (such as SL(n, Z(z})) 
that are not assumed to be lattices. 

Remark 13.4.9(1) is a theorem of Zuk [23, Thm. 4]. (Or see [16] for a 
more detailed proof.) Remark 13.4.9(2) is explained in [21]. 

Remark 13.4.10 is due to S.Gersten (unpublished). A proof (based on 
the same example, but rather different from our sketch) is in [4, Thm. 3.6.5, 
p. 189]. 

Theorem 13.5.4 is due to Delorme [8, Thm. V.1] (for (1 = 2)) and 
Guichardet [10, Thm. 1] (for (8 > 1)). 

Theorem 13.5.7 was proved for discrete groups by Korevaar and Schoen 
[13, Cor. 4.1.3]. The general case is due to Shalom [20, Thm. 6.1]. Corol- 
lary 13.5.8 and Exercise 13.5#18 are also due to Shalom [20, proof of 
Thm. 0.2]. 

The part of Remark 13.5.9 dealing with real or complex hyperbolic spaces 
is in [4, Cor. 2.7.3]. See [5] for median spaces. 
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The existence and uniqueness of the circumcenter (mentioned in the hints 
to Exercises 13.547 and 13.5#11) is proved in [4, Lem. 2.2.7]. 

Exercise 13.5719 is due to Watatani [22], and can also be found in [4, 
§2.3]. Serre’s book [19] is a very nice exposition of the theory of group actions 
on trees, but, unfortunately, does not include this theorem. 

See [4, §2.10-§2.12 and §C.4] for the material of Section 13.6. 
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Chapter 14 


Ergodic Theory 


Ergodic Theory is the study of measure-theoretic aspects of group actions. 
Topologists and geometers may be more comfortable in the category of con- 
tinuous functions, but important results in Chapters 16 and 17 will be proved 
by using measurable properties of actions of I, so we will introduce some of 
the basic ideas. 


§14.1. Terminology 
The reader is invited to skim through this section, and refer back as necessary. 


(14.1.1) Assumption. 


1) All measures are assumed to be o-finite. That is, if w is a measure 
on a measure space X, then we always assume that X is the union of 
countably many subsets of finite measure. 


2) We have no need for abstract measure spaces, so all measures are as- 
sumed to be Borel. That is, when we say p is a measure on a measure 
space X, we are assuming that X is a Borel subset of a complete, sepa- 
rable, metrizable space, and the implied o-algebra on X consists of the 
subsets of X that are equal to a Borel set, modulo a set of measure 0. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [ is a lattice in the semisimple Lie group G C SL(¢,R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: none. 
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(14.1.2) Definitions. Let be a measure on a measure space X. 
1) We say pu is a finite measure if 1u(X) < co. 
2) A subset A of X is: 
e null if u(A) =0, 
e conull if the complement of A is null. 


3) We often abbreviate “almost everywhere” to “a.e” 


4) Essentially is a synonym for “almost everywhere”? For example, a 
function f is essentially constant iff f is constant (a.e.). 


5) Two measures p and v on X are in the same measure class if they 
have exactly the same null sets: 


p(A) =0 — > v(A) =0. 
(This defines an equivalence relation.) Note that if v = fu, for some 
real-valued, measurable function f, such that f(x) 4 0 for a.e. x € X, 


then y and v are in the same measure class (see Exercise 1). The 
Radon-Nikodym Theorem (B6.13) implies that the converse is true. 


(14.1.3) Definitions. Suppose H is a Lie group H that acts continuously on 
a metrizable space X, jz is a measure on X, and A is a subset of X. 


1) The set A is invariant (or, more precisely, H-invariant) if hA = A 
for allh € H. 

2) The measure yw is invariant (or, more precisely, H-invariant) if 
h,j = p for all h € H. (Recall that the push-forward h, is defined in 
(B6.7).) 

3) The measure ps is quasi-invariant if h,j is in the same measure class 
as , for allh € H. 


4) A (measurable) function f on X is essentially H-invariant if, for 
every h € H, we have 


f(hx) = f(x) for ae. x € X. 


The Lebesgue measure on a manifold is not unique, but it determines a 
well-defined measure class, which is invariant under any smooth action: 


(14.1.4) Lemma (see Exercise 4). If X is a manifold, and H acts on X 
by diffeomorphisms, then Lebesgue measure provides a measure on X that is 
quasi-invariant for H. 


Exercises for §14.1. 


#1. Suppose yz is a measure on a measure space X, and f is a real-valued, 
measurable function on X, such that f > 0 for a.e. x. Show that fy is 
in the measure class of py iff f(a) 4 0 for ae. rE X. 
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#2. Suppose a Lie group H acts continuously on a metrizable space X, and 
j1 is a measure on X. Show that yu is quasi-invariant iff the collection of 
null sets is H-invariant. (This means that if A is a null set, andh € H, 
then h(A) is a null set.) 


#3. Suppose 
e A isa null set in R” (with respect to Lebesgue measure), and 
e f is a diffeomorphism of some open subset O of R”. 
Show that f(ANQ) is a null set. 


[Hint: Change of variables.| 


#4. Suppose X is a (second countable) smooth, n-dimensional manifold. 
This means that X can be covered by coordinate patches (Xj, y;) 
(where y;: X; — R”, and the overlap maps are smooth). 


a) Show there exists a partition X = U*, X; into measurable sub- 
sets, such that O¢ C X; for each 7. 

b) Define a measure x on X by p(X) = A(vi(AN Xi)), where is 
the Lebesgue measure on R”. This measure may depend on the 
choice of X;, y;, and Xx: but show that the measure class of pu is 
independent of these choices. 

[Hint: Exercise 3.] 


§14.2. Ergodicity 


Suppose H acts on a topological space X. If H has a dense orbit on X, then it 
is easy to see that every continuous, H-invariant function is constant (see Ex- 
ercise 2). Ergodicity is the much stronger condition that every measurable 
H-invariant function is constant (a.e.): 


(14.2.1) Definition. Suppose A acts on X with a quasi-invariant measure ju. 
We say the action of H is ergodic (or that ys is an ergodic measure for H) 
if every H-invariant, real valued, measurable function on X is essentially 
constant. 


It is easy to see that transitive actions are ergodic (see Exercise 3). But 
non-transitive actions can also be ergodic: 


(14.2.2) Example (Irrational rotation of the circle). For any a € R, we may 
define a homeomorphism Ty, of the circle T = R/Z by 

T,,(2) = x +a (mod Z). 
By considering Fourier series, it is not difficult to show that if a is irrational, 
then every Ty-invariant function in £7(T) is essentially constant (see Exer- 
cise 6). This implies that the Z-action generated by Ty is ergodic (see Exer- 
cise 7). 


Example 14.2.2 is a special case of the following general result: 
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(14.2.3) Proposition. If H is any dense subgroup of a Lie group L, then the 
natural action of H on L by left translation is ergodic (with respect to the 
Haar measure on L). 


Proof. For any measurable f: L — R, its essential stabilizer in L is defined 
to be: 

Stabr(f) ={geEL|f(gx)=f(x) forae. cE L}. 
It is not difficult to show that Stabz(f) is closed (see Exercise 1). (It is also 
a subgroup of L, but we do not need this fact.) Hence, if Stab, (f) contains 
a dense subgroup H, then it must be all of L. This implies that f is constant 
(a.e.) (see Exercise 4). O 


It was mentioned above that transitive actions are ergodic; therefore, G 
is ergodic on G/T. What is not obvious, and leads to important applications 
for arithmetic groups, is that most subgroups of G are also ergodic on G/T: 


(14.2.4) Theorem (Moore Ergodicity Theorem, see Exercise 11.2#11). If 
e H is any noncompact, closed subgroup of G, and 


e I ts irreducible, 
then H is ergodic on G/T. 


If H is ergodic on G/T, then [ is ergodic on G/H (see Exercise 12). 
Hence: 


(14.2.5) Corollary. If H and T are as in Theorem 14.2.4, then I’ is ergodic 
on G/H. 


Exercises for §14.2. 


#1. Show Stab,(f) is closed, for every Lie group L and measurable f: L > 
R. 
[Hint: If f is bounded, then, for any y € C-(L) and {gn} C Stabi(f), we have 
Ff; 7 -1 g 
Si of edu =f, f% edu=lim J, f-% edu=lim J, "f-edu= fi f-pdp] 
#2. Suppose H acts on a topological space X, and has a dense orbit. Show 
that every real-valued, continuous, H-invariant function on X is con- 
stant. 


#3. Show that H is ergodic on H/ Hj, for every closed subgroup Hy; of H. 
(Hint: Every H-invariant function is constant, not merely essentially con- 
stant.| 

#4. Suppose H is ergodic on X, and f: X — Ris measurable and essentially 
H-invariant. Show that f is essentially constant. 


#5. Our definition of ergodicity is not the usual one, but it is equivalent: 
show that H is ergodic on X iff every H-invariant measurable subset 
of X is either null or conull. 
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[Hint: The characteristic function of an invariant set is an invariant function. 
Conversely, the sub-level sets of an invariant function are invariant sets.] 
#6. In the notation of Example 14.2.2, show (without using Proposition 14.2.3): 
a) If q is irrational, then every T,-invariant function in £°(T) is es- 
sentially constant. 
b) If q@ is rational, then there exist T-invariant functions in £7(T) 
that are not essentially constant. 
[Hint: Any f € £°(T) can be written as a unique Fourier series: f = 
ee ane’. If f is invariant and a is irrational, then uniqueness im- 
plies an = 0 for n £ 0. 
#7. Suppose ys is an H-invariant, finite measure on X. For all p € [1, co], 
show that H is ergodic iff every H-invariant element of £°(X, 1) is 
essentially constant. 


#8. Let H = Zact on X = R by translation, and let w be Lebesgue measure. 
Show: 
a) H is not ergodic on X, and 
b) for every p € [1, 00), every H-invariant element of £?(X, 4) is es- 
sentially constant. 
Why is this not a counterexample to Exercise 7? 


#9. Let H be a dense subgroup of L. Show that if L is ergodic on X, then 
HT is also ergodic on X. 
[Hint: Exercise 1.] 


#10. Show that if H acts continuously on X, and w is a quasi-invariant 
measure on X, then the support of 4 is an H-invariant subset of X. 


#11. Ergodicity implies that a.e. orbit is dense in the support of uw. More 
precisely, show that if H is ergodic on X, and the support of w is all 
of X (in other words, no open subset of X has measure 0), then a.e. 
H-orbit in X is dense. (That is, for a.e. x € X, the orbit Hx of x is 
dense in X. 
[Hint: The characteristic function of the closure of any orbit is invariant.| 


#12. Suppose H is a closed subgroup of G. Show that H is ergodic on G/T 
iff [ is ergodic on G/H. 
[Hint: H xT acts on G (by letting H act on the left and [ act on the right). 
Show H is ergodic on G/T iff H xT is ergodic on G iff T is ergodic on G/H.] 
#13. The Moore Ergodicity Theorem has a converse: Assume G is not com- 
pact, and show that if H is any compact subgroup of G, then H is not 
ergodic on G/T. 


[Hint: T acts properly discontinuously on G'/H, so the orbits are not dense.] 


#14. Show that ifn > 2, then 
a) the natural action of SL(n, Z) on R” is ergodic, and 
b) the SL(n, Z)-orbit of a.e. vector in R” is dense in R”. 
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[Hint: Identify R” with a homogeneous space of G = SL(n,R) (a.e.), by 
noting that G is transitive on the nonzero vectors of R”.] 
#15. Let 
e G=SL(3,R), 
e I be a lattice in G, and 
* 
e P= * * | CG. 
Show: 
a) The natural action of [ on the homogeneous space G/P is ergodic. 
b) The diagonal action of [ on (G/P)? = (G/P) x (G/P) is ergodic. 
c) The diagonal action of [ on (G/P)? = (G/P) x (G/P) x (G/P) is 
not ergodic. 


[Hint: G is transitive on a conull subset of (G/P)*, for k < 3. What is the 
stabilizer of a generic point in each of these spaces?| 


#16. Assume I is irreducible, and let H be a closed, noncompact subgroup 
of G. Show, for a.e. « € G/T, that Hz is dense in G/T. 


#17. Suppose H acts ergodically on X, with invariant measure jp. Show that 
if u(X) < oo and H = R, then some cyclic subgroup of H is ergodic 
on X. 
[Hint: For each t € R, choose a nonzero, h'-invariant function f, € L?(X), 
such that f; | 1. The projection of f,. to the space of h*-invariant functions is 
invariant under both h” and h*. Therefore, if r and s are linearly independent 
over Q, then f, 1 fs. This is impossible, because £7(X, 1) is separable.] 


§14.3. Consequences of a finite invariant measure 


Measure-theoretic techniques are especially powerful when the action has an 
invariant measure that is finite. One example of this is the Poincaré Recur- 
rence Theorem (4.6.1). Here is another. 

We know that almost every orbit of an ergodic action is dense (see Ex- 
ercise 14.2#11). For the case of a Z-action with a finite, invariant measure, 
the orbits are not only dense, but uniformly distributed: 

(14.3.1) Definition. Let 
e uw be a finite measure on a topological space X, and 
e T’ be a homeomorphism of X. 


The (T)-orbit of a point x in X is uniformly distributed with respect to wu 
if 


lL 1 
lim > F(TN(e)) = fF 
bin 2S 70040) = ahs f 
for every bounded, continuous function f on X. 


(14.3.2) Theorem (Pointwise Ergodic Theorem). Suppose 
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e yu is a finite measure on a second countable, metrizable space X, and 


e T is an ergodic, measure-preserving homeomorphism of X. 


Then a.e. (T)-orbit in X is uniformly distributed (with respect to pL). 


It is tricky to show that limy+. 4 peer (T i (x)) converges pointwise 
(see Exercise 5). Convergence in norm is much easier (see Exercise 3). 
(14.3.3) Remark. Although the Pointwise Ergodic Theorem was stated only 
for actions of a cyclic group, it generalizes very nicely to the ergodic actions 
of any amenable group. (The values of f are averaged over an appropriate 


Felner set in the amenable group.) See Exercise 8 for actions of R. 


Exercises for §14.3. 


#1. 


#2. 


#3. 


4A, 


Suppose the (7)-orbit of x is uniformly distributed with respect to a 
finite measure on X. Show that if the support of wp is all of X, then 
the (T)-orbit of x is dense in X. 


Suppose 
e U isa unitary operator on a Hilbert space H, 
ev EH, and 
e (v|w) =0, for every vector w that is fixed by U. 
Show + )07_, U*u 30 as n> ov. 
[Hint: Apply the Spectral Theorem to diagonalize the unitary operator U.| 


(Mean Ergodic Theorem) Assume the setting of the Pointwise Ergodic 
Theorem (14.3.2). Show that if f € £?(X, 1), then 


1g i 
—S° f(T* (a +o f fan inf 
Tat ( ( ) WX) Sx 

That is, show 


= 0. 
2 


lim 
nCoO 


lL 1 

= F(EM@)) — ay ff 
nf) — Fon J, 

Do not assume Theorem 14.8.2. 


[Hint: Exercise 2.] 


Assume X, 4, and T are as in Theorem 14.3.2, and that u(X) = 1. For 
f € L'(X, p), define 


Sn(x) = f(x) + f(T(x)) +---+ f(T" "(2)). 


Prove the Maximal Ergodic Theorem: for every a € R, if we let 
Sn(z) } 
sup >a?, 
n 


n 


Ea={aeXx 


then f,, fdu > ap(£). 

[Hint: Assume a = 0. Let S7 (x) = maxo<e<n Sk(x), and EF, ={x| St > 
0}, so E = UnEn. For x € En, we have f(x) > Si (x) — Si (T(z)), so 
Si, fa 2 0.] 
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#5. Prove the Pointwise Ergodic Theorem (14.3.2). 


[Hint: Either {x | limsup S,(x)/n > a} or its complement must have mea- 
sure 0. If it is the complement, then Exercise 4 implies [ pha ea] 


#6. Assume the setting of the Pointwise Ergodic Theorem (14.3.2). For 
every bounded y € £1(X, y), show, for a.e. x € X, that 


Jim ~ Y o(T(n)) = ay | ee 


[Hint: You may assume the Pointwise Ergodic Theorem. Use Lusin’s Theo- 
rem (B6.6) to approximate y by a continuous function.| 


#7. (harder) Remove the assumption that y is bounded in Exercise 6. 


#8. Suppose 
e {a‘} is a (continuous) 1-parameter group of homeomorphisms of a 
topological space X, and 
e yw is an ergodic, a‘-invariant, finite measure on X. 
For every bounded, continuous function f on X, show that 
1 


: i 
lim af flats ar=—— | fay for a.e. rE X. 
ToT Jo (ea) WX) Sx 


[Hint: Apply Theorem 14.3.2 to f(x) = Ls f(a‘x) dt if a’ is ergodic (cf. 
Exercise 14.2#17).] 


§14.4. Ergodic decomposition 


In this section, we briefly explain that every group action (with a quasi- 
invariant measure) can be decomposed into ergodic actions. 


(14.4.1) Example (Irrational rotation of the plane). For any irrational a € R, 
define a homeomorphism T,, of C by T,(z) = e27x. Then |T,(z)| = |z|, so 
each circle centered at the origin is invariant under T,. The restriction of 
T. to any such circle is an irrational rotation of the circle, so it is ergodic 
(see Example 14.2.2). Thus, the entire action can be decomposed as a union 
of ergodic actions. 


A similar decomposition is always possible, as long as we work with nice 
spaces: 


(14.4.2) Definition. A topological space X is Polish if it is homeomorphic 
to a complete, separable metric space. 


(14.4.3) Theorem (Ergodic decomposition). Suppose a Lie group H acts 
continuously on a Polish space X, and ps is a quasi-invariant, finite measure 
on X. Then there exist 


e a measurable function ¢: X — [0,1], and 


e a finite measure uz on C~'(z), for each z € [0,1], 
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such that = So yy He dv(z), where v = ¢,ps. For f € C.(X), this means 


[tam ff fecal 


Furthermore, for a.e. z € (0, 1], 

1) ¢-1(z) is H-invariant, and 

2) uz is quasi-invariant and ergodic for the action of H. 
(14.4.4) Remark. The ergodic decomposition is unique (a.e.). More precisely, 
if ¢’ and yw), also satisfy the conclusions, then there is a measurable bijection 
m: [0,1] — [0,1], such that 

1)  =70¢ ae., and 

2) Mas) = pl, for ae. z. 


(14.4.5) Definition. In the notation of Theorem 14.4.3, each set ¢~!(z) is 
called an ergodic component of the action. 


The remainder of this section sketches two different proofs of Theo- 
rem 14.4.3. 


§14.4(i). First proof. The main problem is to find the function ¢, be- 
cause the following general Fubini-like theorem will then provide the required 
decomposition of jz into an integral of measures 4, on the fibers of ¢. (In 
Probability Theory, each jz, is called a conditional measure of [.) 
(14.4.6) Proposition. Suppose 

e X and Z are Polish spaces, 

e ¢:X > Z is a Borel measurable function, and 

e 4 is a probability measure on X. 

Then there is a Borel map A: Z — Prob(X), such that 
1) w= f, Az dv(z), where v = Cp, and 
2) Ag (> * @) = 1; forall ze Z. 

Furthermore, X is unique (a.e.). 

The map ¢ is a bit difficult to pin down, since it is not completely well- 
defined — it can be changed on an arbitrary set of measure zero. We cir- 
cumvent this difficulty by looking not at the value of ¢ on individual points 
(which is not entirely well-defined), but at its effect on an algebra of functions 
(which is completely well defined). 

(14.4.7) Definitions. Assume yp is a finite measure on a Polish space X. 


1) Let B(X) be the collection of all Borel subsets of X, where two sets are 
identified if they differ by a set of measure 0. This is a o-algebra. 
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2) B(X) is a complete, separable metric space, with respect to the metric 
d(A, B) = u(AAB), where AAB = (A\ B)U(B\ A) is the symmetric 
difference of A and B. 

3) If a Lie group H acts continuously on X, we let B(X)” be the set of 
H-invariant elements of B(X). This is a sub-o-algebra of B(X). 

The map ¢ is constructed by the following result: 


(14.4.8) Lemma. Suppose 
e py is a finite measure on a Polish space X, and 
e B is a sub-o-algebra of B(X). 
Then there is a Borel map ¢: X — [0,1], such that 
B={¢7'(E) | E is a Borel subset of [0,1] }. 


Idea of proof. Let 
e {E,,} be a countable, dense subset of B, 
e X,, be the characteristic function of E,, for each n, and 


© (a) =>) BY) 


n=1 

It is clear from the definition of ¢ that if J is any open interval in [0, 1], then 
¢~1(1) is a Boolean combination of elements of {E,,}; therefore, it belongs 
to B. Since B is a o-algebra, this implies that ¢~1(£) € B for every Borel 
subset E of [0, 1]. 

Conversely, it is clear from the definition of ¢ that each E,, is the inverse 
image of a (one-point) Borel subset of [0,1]. Since {E,,} generates B as a 
o-algebra (see Exercise 2), this implies that every element of B is the inverse 
image of a Borel subset of [0, 1]. O 


We will use the following very useful fact: 


(14.4.9) Theorem (von Neumann Selection Theorem). Let 

e X andY be Polish spaces, 

e yw be a finite measure on X, 

e F bea Borel subset of X x Y, and 

e Xz be the projection of F to X. 
Then there is a Borel function ®: X —+ Y, such that (a; ®(x)) € Ff, for a.e. 
ceEXF. 


The first proof of Theorem 14.4.3. We wish to show that wp, is er- 
godic (a.e.). If not, then there is a set E of positive measure in [0,1], 
such that, for each z € EF, the action of H on (ere ae) is not ergodic. 
This means there exists an H-invariant, measurable, {0, 1}-valued function 
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yz € L(C~'(z), wz) that is not constant (a.e.). There are technical prob- 
lems that we will ignore, but, roughly speaking, the von Neumann Selection 
Theorem (14.4.9) implies that the selection of y, can be done measurably, so 
we have a Borel subset A of X, defined by 

A={xeEX |¢(“) € EF and y,(z) =1}. 

Since y, is not constant on the fiber ¢~1(z), we know that A is not of the 
form ¢~'(E), for any Borel subset E of [0,1]. On the other hand, we have 
A € B(X)" (since each vy, is H-invariant). This contradicts the choice of the 
function ¢. 


§14.4(ii). Second proof. We now describe a different approach. In- 
stead of obtaining the decomposition of yz from the map ¢, we reverse the 
argument, and obtain the map ¢ from a direct-integral decomposition of p. 
For simplicity, however, we assume that the space we are acting on is compact. 
We consider only invariant measures, instead of quasi-invariant measures, so 
we do not have to keep track of Radon-Nikodym derivatives. 


(14.4.10) Definitions. Suppose C is a convex subset of a vector space V. 

1) A point c € C is an extreme point of C if there do not exist co,c, € 

C \ {c} and t € (0,1), such that c = tcg + (1 — t)c1. 

2) Let ext C be the set of extreme points of C. 
(14.4.11) Example. Suppose H acts continuously on a compact, separable 
metric space X, and let 

Prob(X)” = { u € Prob(X) | y is H-invariant }. 

This is a closed, convex subset of Prob(X), so Prob(X)” is a compact, convex 
subset of C(X)* (with the weak* topology). The extreme points of this set 


are precisely the H-invariant probability measures that are ergodic (see Ex- 
ercise 3). 


The well-known Krein-Milman Theorem states that every compact, con- 
vex set C’ is the closure of the convex hull of the set of extreme points of C’. 
(So, in particular, if Cis nonempty, then there exists an extreme point.) We 
will use the following strengthening of this fact: 

(14.4.12) Theorem (Choquet’s Theorem). Suppose 

e Y is a locally convex topological vector space over R, 

e C is a metrizable, compact, convex subset of VY, and 

ecmEeC. 


Then there is a probability measure v on ext C, such that 


co= i xdv(x). 
ext C 


We will also need a corresponding uniqueness result. 
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(14.4.13) Definitions (Choquet). Suppose Y and C are as in the statement 
of Theorem 14.4.12. 


1) Let HC = {tce|t€[0,cc), cEC}Cy. 
2) Define a partial order < on HC by a < biffb-—ae XC. 


3) Two elements a1,a2 € UC have a least upper bound if there exists 
b € NC, such that 
e a; <6 fori = 1,2, and 
e for all ce UC, such that a; < c for i = 1,2, we have b < «. 


(14.4.14) Example. Any two elements of © Prob(X) have a least upper 
bound (see Exercise 4), so the same is true for Prob(X)”. 


(14.4.15) Theorem (Choquet). Suppose Y, C, and co are as in the statement 
of Theorem 14.4.12. If every two elements of iC have a least upper bound, 
then the measure v provided by Theorem 14.4.12 is unique. 


The second proof of Theorem 14.4.3. Assume, for simplicity, that py 
is H-invariant, and that X is compact. By normalizing, we may assume 
p(X) = 1, so uw € Prob(X). Then Choquet’s Theorem (14.4.12) provides a 
probability measure v, such that 


i / odv(c). 
ext Prob(X)# 


By identifying ext Prob(X) with a Borel subset of [0,1], we may rewrite this 


as: 
b= ( juz dv (2), 
[0,1] 


where v is a probability measure on |0, 1]. Furthermore, Exercise 3 tells us that 
each o € ext Prob(X)” is ergodic, so pz is an ergodic H-invariant measure 
for a.e. z. 

All that remains is to define a map ¢: X — [0,1], such that p, is 
concentrated on ¢~+(z). For each Borel subset E of ext Prob(X)", let 
pf, po7d(c). Then pg is absolutely continuous with respect to py, so 
we may write uz = fey, for some measurable fg: X — [0,00). Then 


P(E) ={c£ EX | fe(x) £0} 
is a well-defined element of 6(X). Therefore, we have defined a map wy: B ({0, 1)) = 
B(X), and it can be verified that this is a homomorphism of o-algebras. 
Hence, there is a measurable function ¢: X — [0,1], such that ~(F) = 
¢~'(E), for all E (see Exercise 5). By using the uniqueness of v, it can 
be shown that (¢~!(z)) =1 for ae. z. O 


Exercises for §14.4. 


#1. Let B bea (nonempty) subset of B(X) that is closed under complements 
and finite unions. Show that 6 is closed under countable unions (so B 
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is a sub-o-algebra of 6(X)) if and only if B is a closed set with respect 
to the topology determined by the metric on B(X). 
[Hint: (<=) If FE =U, &, then Uf_, Hi > E in the topology on B(X). 
(=) Ifd( Ei, FE) < 2™, then FE = (\"~_, UX, Ei (up to a set of measure 0.] 
#2. Show that if € is dense in a sub-o-algebra 6 of B(X), then € is not 
contained in any proper sub-o-algebra of B. 
[Hint: If E, > E, then U7, (En E) = E (up to a set of measure 0).] 
#3. Prove that a measure yp € Prob(X)” is ergodic iff it is an extreme 
point. 
[Hint: If E is an H-invariant set, then p is a convex combination of the 
restrictions to E and its complement. Conversely, if yo = tui + (1 — t) pe, 


then the Radon-Nikodym Theorem implies ui = f 4 for some (H-invariant) 
function f.] 


#4. Show that any two elements of © Prob(X) or © Prob(X)* have a least 
upper bound. 


(Hint: Write v = f +s (uniquely), where v; is concentrated on a set of 
measure 0.| 


#5. Suppose ~: B(Z) > B(X) is a function that respects complements 
and countable unions (and 7(@) = 0). Show there is a Borel function 
¢: X > Z, such that (EF) = ¢~1(£), for every Borel subset E of Z. 
[Hint: Assume, for simplicity, that Z = {> ax3~* | ax € {0,1}} C [0,1]. 


Then ¢ = )>Xpn,3-“ for an appropriate collection {E,} of Borel subsets 
of X.] 


§14.5. Mixing 


It is sometimes important to know that a product of group actions is ergodic. 
To discuss this issue (and related matters), let us fix some notation. 
(14.5.1) Notation. Throughout this section, we assume: 

1) X; is a Polish space, for every i, 

2) H is a Lie group that acts continuously on X;, for each 7, and 

3) pi; is an H-invariant probability measure on X;, for each 7. 


Furthermore, we use X and yw as abbreviations for Xq and po, respectively. 


(14.5.2) Definitions. 

1) The product action of H on X, x X2 is the H-action defined by 
h(a1,%2) = (ha,,hx2). The product measure jz, X 2 is an invariant 
measure for this action. 

2) The action on X is said to be weak mixing (or weakly mixing) if 
the product action on X x X is ergodic. 


We have the following very useful characterizations of weakly mixing ac- 
tions: 
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(14.5.3) Theorem. The action of H on X is weak mixing if and only if the 
(one-dimensional) space of constant functions is the only nontrivial, finite- 
dimensional, H-invariant subspace of L>X, 1). 


Proof. We prove the contrapositive of each direction. 

(=) Suppose V is a nontrivial, finite-dimensional, H-invariant subspace 
of £°X). If the functions in V are not all constant (a.e.), then we may 
assume (by passing to a subspace) that V L 1. Choose an orthonormal basis 
{~1,---, Yr} of V, and define y: X x X > C by 


y(a,y) = ae yi(x) ily). (14.5.4) 


Then y is an H-invariant function that is not constant (see Exercise 2), so H 
is not ergodic on X x X. 
(<) Suppose y is a nonconstant, H-invariant, bounded function on X x 


X. We may assume (x,y) = y(y,x) by replacing y with either y(z, y) + 


p(y, x) or V—1 (p(x, y) — yy, z)). Therefore, we have a compact, self-adjoint 
operator on £°(X, 11), defined by 


(Tp)(2) = I pay oaNanl): 


The Spectral Theorem (B7.14) implies T has an eigenspace V that is finite- 
dimensional (and contains a nonconstant function). This eigenspace is H- 
invariant, since T commutes with H (because y is H-invariant). O 


We often have the following stronger condition: 


(14.5.5) Definition. The action of H on X is said to be mixing (or, alter- 
natively, strongly mixing) if H is noncompact, and, for all y,w € LX, 1), 
such that y L 1, we have 


(hy | %) =0. 


(We are using 1 to denote the constant function of value 1, and, as usual, 
(hy)(x) = y(h~tx) (see Example 11.1.3).) 


lim 
hoo 


(14.5.6) Proposition (see Exercise 3). If H is not compact, then the following 
are equivalent: 


1) The action of H on X is mixing. 
2) For all y,w € LX, pw), we have 
jim (hy |) = (| 1) (1 |e). 
oo 
3) For all Borel subsets A and B of X, we have 
jim w(hAN B) = (A) w(B). 
00 
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(14.5.7) Remark. Condition (3) is the motivation for the choice of the term 
“mixing:” as h — oo, the space X is getting so stirred up (or well-mixed) 
that hA is becoming uniformly distributed throughout the entire space. 


When G is simple, decay of matrix coefficients (11.2.2) implies that every 
action of G (with finite invariant measure) is mixing. In fact, we can say 
more. 


(14.5.8) Definition. Generalizing Definition 14.5.5, we say that the action 
of H on X is mixing of order r if H is not compact, and, for all Borel 
subsets Aj,...,A, of X, we have 


a! a has) = p(A1) u(A2) «+> W(Ar). 
In particular, 
e every action of H is mixing of order 1 (if H is not compact), and 
e “mixing” is the same as “mixing of order 2” 


(14.5.9) Warning. Some authors use a different numbering, for which this is 
“mixing of order r — 1? instead of “mixing of order r? 


Ledrappier constructed an action of Z? that is mixing of order 2, but not 
of order 3. However, there are no such examples for semisimple groups: 


(14.5.10) Theorem. Every mizing action of G (with finite invariant measure) 
is mixing of all orders. 


In the special case where H = Z, we mention the following additional 
characterizations, some of which are weaker versions of Proposition 14.5.6(3): 


(14.5.11) Theorem. Jf H = (T) is an infinite cyclic group, then the following 
are equivalent: 


1) A is weak mixing on X. 


2) Every eigenfunction of T in £°(X,) is constant (a.e.). That is, if 
f © LYX, pu), and there is some X € C, such that f(Tx) =A f(x) ae., 
then f is constant (a.e.). 


3) The spectral measure of T has no atoms other than 1, and the eigen- 
value 1 is simple (that is, the corresponding eigenspace is 1-dimensional). 
4) The action of H on X x X, is ergodic, whenever the action of H on X, 
is ergodic. 
5) For all Borel subsets A and B of X, 
_ lg 
lim — S°|u(T*AN B) — (A) p(B)| = 0. 


noo nN 
k=1 
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6) There is a subset N of full density in Z* , such that, for all Borel subsets 
A and B of X, we have 


u(T®AN B) — p(A) p(B) ask — co withk EN. 
To say N has full density means 


2 FEIN 415253, 2: 590) 
lim 
n—- oo n 
7) If Ao, A1,--., A,r are any Borel subsets of X, then there is a subset N 
of full density in Z*, such that 


jim 1(Ao al Pr As a Peta. MA TA, ) = j(Ao) (Ay) aiiey p(A,). 
—0o 
keN 


Sketch of proof. (1 <= 2) By Theorem 14.5.3, it suffices to observe that 
every finite-dimensional representation contains an irreducible subrepresenta- 
tion, and that the irreducible representations of Z (or, more generally, of any 
abelian group) are one-dimensional. 

(2 = 3) These are two different ways of saying the same thing. 

(3 = 4) Since £°X x X1) = £7 X) @ L7X)), the spectral measure v 
of £°(X x X,) is the product 1, x v2 of the spectral measures of £°.X) and 
£°X,). Therefore, any point mass in v is obtained by pairing a point mass 
in Vy; with a point mass in 1. 

(4 => 1) Take X, = X. 

(1 = 5) For simplicity, let a = (A) and b = p(B). By Exercise 14.343 
(the Mean Ergodic Theorem), ergodicity on X implies 


=; 


i ne 
— > w(T*ANB) <2 ab. 


k=1 
For the same reason, ergodicity on X x X implies 
So(u x uw) (TRA x TEA) 9 (Bx B)) "=F (wx p)(A x A) + (Hx H)(B x B). 
k=1 


By simplifying both sides, we see that 


lg n¢0 
~S 0 (TRAN BY? "> a?b?. 
a= 

Therefore, simple algebra yields 


YS |u(T* AN B) - pA) y(B)|” "23° a? b? — 2(ab)(ab) + (ab)? = 0. 


Exercise 4 implies that we have the same limit without squaring the absolute 
value. 

(5 = 2) Approximating by linear combinations of characteristic functions 
implies 


Jin Get *» | »)| = 0 for all y 1 1. 
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However, if y is an eigenfunction for an eigenvalue # 1, then it is easy to see 
that the limit is nonzero (either directly, or by applying Exercise 4). 

(5 = 6) Exercise 4 implies that the two assertions are equivalent, up to 
reversing the order of the quantifiers in (6). To reverse the quantifiers, note 
that a variant of Cantor diagonalization provides a set N of full density that 
works for all A and B in a countable dense subset of B(X). 

(7 => 6) Take r = 1. 

(6 = 7) The proof proceeds by induction on r (with (6) as the starting 
point), and is nontrivial. We have no need for this result, so we omit the 
proof. 


Exercises for §14.5. 


#1. Show (directly from the definitions) that if the action of H on X is 
weak mixing, then it is ergodic. 


#2. Let yp: X x X > C be as in (14.5.4) of the proof of Theorem 14.5.3. 
a) Show y is H-invariant (a.e.). 
b) Show ¢ is not constant (a.e.). 
[Hint: (a) Write hy: = )7,,hij~;, and observe that [hi,j] is a unitary 
matrix. 
(b) v(x, x) > 0, but f (x,y) duly) = 0.] 
#3. Prove Proposition 14.5.6. 


[Hint: (1 = 2) For c = (y | 1), we have ((y — c) | 1) = 0. Now calculate 
limp—+oo(h(y — c) | w) in two ways. 

(2 => 3) Let y and w be the characteristic functions of A and B. 

(3 = 2) Approximate vy and w by linear combinations of characteristic func- 
tions. ] 


#4. For every bounded sequence {a,} C [0,00), show 
n 
lim — SS ap =0 & ay >0as k > w in some set of full density. 
noo N 
k=1 
[Hint: (=) For each m > 0, the set A» = {k | ax > 1/m} has density 0, so 
there exists Nm > Nm-—1i, such that, for all n > Nm, we have 
Nm—1+ #(Am pg ore ,n}) < n/m. 

Let NV be the complement of U,,, (Am [Nm, Nm+1))-] 


Notes 


The focus of classical Ergodic Theory is on actions of Z and R (or other 
abelian groups). A few of the many introductory books on this subject are 
(1, 4, 5, 12]. They include proofs of the Poincaré Recurrence Theorem (4.6.1) 
and the Pointwise Ergodic Theorem (14.3.2). 

Some basic results on the Ergodic Theory of noncommutative groups can 
be found in [13, §2.1]. 
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The Moore Ergodicity Theorem (14.2.4) is due to C.C. Moore [8}. 

See [7] for a very nice version of the Pointwise Ergodic Theorem that 
applies to all amenable groups (Remark 14.3.3). 

See [3, Thm. 1.1 (and Thm. 5.2)] for a proof of the ergodic decompo- 
sition (14.4.3), using Choquet’s Theorem as in Subsection 14.4(ii). Proposi- 
tion 14.4.6 can be found in [10, §3]. See [11, §5.5] for a proof of Theorem 14.4.9. 

See [8, Prop. 1, pp. 157-158] for a proof of Proposition 14.5.6. 

The standard texts on ergodic theory only prove Proposition 14.5.6 for 
the special case H = Z, but the same arguments apply in general. 

Theorem 14.5.10 is due to S. Mozes [9]. Ledrappier’s counterexample for 
Z? is in [6]. 

Theorem 14.5.11 is in the standard texts on ergodic theory, except for 
Part (7), which is a “multiple recurrence theorem” that plays a key role in 
Furstenberg’s proof of Szemeredi’s theorem that there are arbitrarily long 
arithmetic progressions in every set of positive density in Zt. For a proof of 
(6 = 7), see [1, Prop. 7.13, p. 191] or [2, Thm. 4.10). 

A proof of Exercise 14.5#4 is in [1, Lem. 2.41, p. 54]. 
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Part IV 


Major Results 


Chapter 15 


Mostow Rigidity Theorem 


§15.1. Statement of the theorem 


In its simplest form, the Mostow Rigidity Theorem says that a single group [° 
cannot be a lattice in two different semisimple groups G, and G» (except for 
minor modifications involving compact factors, the center, and passing to a 
finite-index subgroup): 


(15.1.1) Theorem (Weak version of the Mostow Rigidity Theorem). Assume 


e G, and Gz are connected, with trivial center and no compact factors, 
and 
e I; is a lattice in G;, fori = 1,2. 
If 1, =o, then Gy = Go. 


In other words, if there is an isomorphism from I’; to 2, then there is also 
an isomorphism from G, to G2. In fact, it is usually the case that something 
much stronger is true: 

(15.1.2) Mostow Rigidity Theorem. Assume 


e G, and G2 are connected, with trivial center and no compact factors, 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, Tis a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: Quasi-isometries (Chapter 10). 
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e I; is a lattice in G;, fori = 1,2, and 


e there does not exist a simple factor N of Gi, such that N ~ PSL(2,R) 


and NOT, is a lattice in N. 


Then any isomorphism from 1, to Tg extends to a continuous isomorphism 
from Gy to Go. 


(15.1.3) Remarks. 


1) 


Assume G is connected, and has no simple factors that are either com- 
pact or isogenous to SL(2,R). Then the Mostow Rigidity Theorem 
implies that lattices in G have no nontrivial deformations. More pre- 
cisely, if [; is a continuous family of lattices in G, then IT; is conjugate 
to Io, for every t (see Exercise 7). 

This is not always true when G is isogenous to SL(2,R) (see Sec- 
tion 15.3), which explains why the statement of the Mostow Rigidity 
Theorem must forbid factors that are isogenous to SL(2, R). 


In geometric terms, the Mostow Rigidity Theorem tells us that the 
topological structure of any irreducible finite-volume locally symmetric 
space of noncompact type completely determines its geometric structure 
as a Riemannian manifold (up to multiplying the metric by a scalar on 
each irreducible factor of the universal cover), if the manifold is not 
2-dimensional (cf. Exercise 8). 


Assume IT’ is cocompact in G. Then, as a strengthening of Theo- 
rem 15.1.1, it can be shown that if I is a cocompact lattice in some 
Lie group H (not assumed to be semisimple), then H must be either G 
or [ (modulo the usual minor modifications involving compact groups). 
In fact, this remains true even if we allow H to be any locally compact 
group, not necessarily a Lie group. 


Exercises for §15.1. 


HA. 


#2. 


#3. 


Suppose 

e G has trivial center and no compact factors, 

e GS PSL(2,R), and 

e IT is irreducible. 
Show that every automorphism of [ extends to a continuous automor- 
phism of G. 


Show that Theorem 15.1.1 is a corollary of Theorem 15.1.2. 


[Hint: This is obvious when no simple factor of G; is isomorphic to PSL(2, R). 
The problem can be reduced to the case where G; and G2 are irreducible.] 


Assume G'; and G2 are isogenous. Show that if K is any compact group, 
then some lattice in G’; is isomorphic to a lattice in G2 x K (even though 
G, may not be isomorphic to Gz x K). This is why Theorem 15.1.1 
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assumes G, and G2 are connected, with trivial center and no compact 
factors 
[Hint: Any torsion-free lattice in G? is isomorphic to a lattice in G2 x K.] 


#4. For 1 = 1,2, suppose 
e I’; is a lattice in G;, and 
e G; is connected and has no compact factors. 
Show that if I; is isomorphic to Iz, then Gy is isogenous to Go. 


#5. Let [; be a lattice in G; for 7 = 1,2. Show that if [; = To, then 
there is a compact, normal subgroup K; of G?, for 7 = 1,2, such that 
Go/Ky = G5/Ko. 


#6. Let G = PSL(2,R). Find an automorphism y of some lattice [ in G, 
such that y does not extend to an automorphism of G. 
Why is this not a counterexample to Theorem 15.1.2? 


#7. Assume there is a continuous function p: [ x [0,1] - G, such that, if 
we let p:(y) = p(y, t), then: 
e p; is a homomorphism, for all f, 
e p:(T) is a lattice in G, for all t, and 
e poly) =7, for all y ET. 
Show that if G is as in the first sentence of Remark 15.1.3(1), then I, 
is conjugate to I, for every t. 


[Hint: Reduce to the case where G has trivial center. You may use, without 
proof, the fact that the identity component of the automorphism group of G 
consists of inner automorphisms (see Remark A6.4).| 


#8. (Requires some familiarity with locally symmetric spaces) Assume, for 
7=1,2: 
G; is connected and simple, with trivial center, 
K; is a maximal compact subgroup of G;, 
I’; is a torsion-free, irreducible lattice in G;, 
X; = K;\G;/T; is the corresponding locally symmetric space of 
finite volume, 
e the metric on X; is normalized so that vol(X1) = 1, and 
e dim xX 1 = 3. 
Show that any homotopy equivalence from X, to X2 is homotopic to 
an isometry. 


[Hint: Since the universal cover K;\G; is contractible, a homotopy equiva- 
lence is determined, up to homotopy, by its effect on the fundamental group.| 


§15.2. Sketch of the proof for SO(1, 7) (optional) 


In most cases, the conclusion of the Mostow Rigidity Theorem (15.1.2) is an 
easy consequence of the Margulis Superrigidity Theorem, which will be dis- 
cussed in Chapter 16. More precisely, if we assume, for simplicity, that the 


312 15. MOSTOW RIGIDITY THEOREM 


lattices [, and [2 are irreducible (see Exercise 1), then the Margulis Super- 
rigidity Theorem applies unless G; and G2 are isogenous to either SO(1, 7) 
or SU(1,n) (see Exercise 16.2#1). To illustrate the main ideas involved in 
completing the proof, we discuss a special case: 


Proof of Mostow Rigidity Theorem for cocompact lattices in SO(1, 7). 
Assume, for i = 1, 2: 

e G; = PSO(1,7n;), for some n; > 3, 

e I’; is cocompact in G;, and 

e p: 1, > [2 is an isomorphism. 
In order to show that p extends to a continuous homomorphism from G to Go, 
we take a geometric approach that uses the action of G; on its associated 
symmetric space, which is the hyperbolic space §)"'.. This assumes some 


understanding of hyperbolic space (and other matters) that is not required 
elsewhere in this book. 


Claim. We have ny = nz. By passing to subgroups of finite index, we 
may assume I’; and Ip are torsion free, so I; acts freely on 9”. The action 
is also properly discontinuous, so, since hyperbolic space is contractible, this 
implies that X; =T;\" is a K(T;,1)-space. Since X; is a compact manifold 
of dimension n;, we conclude that the cohomological dimension of T; is nj. 
However, the groups I; and [2 are isomorphic, so they must have the same 
cohomological dimension. This completes the proof of the claim. LJ 


Therefore, [, and [ are two lattices in the same group G = PSO(1,n). 
To simplify matters, let us assume n = 3. 
Since T'; is cocompact in G; (so T; is quasi-isometric to $9), it is not 
difficult to construct a quasi-isometry y: 9° — 9%, such that 
y(yz) = ply): v(x) for ally €T, and x € §° (15.2.1) 


(see Exercise 2). 

Consider the ball model of §*, whose boundary 0)? is the round 2- 
sphere S?. It is easy to see that any isometry ¢ of §)° induces a well-defined 
homeomorphism ¢ of 05°. Furthermore, it is well known that this boundary 
map is conformal (i.e., it is angle-preserving). This implies that if C is 
a very small circle in 09°, then ¢(C) is very close to being a circle; more 
precisely, if we let S,.(p) be the sphere of radius r around the point p, then 

suPz<s,(p) (O(a), O(p)) 


lim sup — = = 
r>0+ infyes,(p) d(o(y), (p)) 
With this background in mind, it should not be difficult to believe (and it is 
not terribly difficult to prove) that the quasi-isometry y induces a well-defined 
homeomorphism @ of 059°, such that 


(yp) = p(y) - G(p) for all y ET; and p € AH’. (15.2.2) 
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Furthermore, this boundary map is quasi-conformal, which means that if 
C is a very small circle in 09%, then G(C) is approximated by an ellipse of 
bounded eccentricity; more precisely, there is some constant K > 0, such that, 
for all p € 08°, we have 


ij SUPz€5S,.(p) d(P(x), P(p)) 
im sup - 


r>0t infyes,.(p) d(P(y), P(p)) 

It is a fundamental, but highly nontrivial, fact that quasi-conformal maps 
are differentiable almost everywhere. Therefore, if C’, is a circle (centered at 
the origin) in the tangent plane at almost any point p € 09%, then G(C,) 
is an ellipse in the tangent plane at G(p). Furthermore, since multiplying all 
the vectors in a tangent plane by a scalar does not change the eccentricity 
of any ellipse in the plane, we see that the eccentricity e, of this ellipse is 
independent of the choice of the circle C,. So e, is a well-defined, measurable 
function on (almost all of) 09°. 


Case 1. Assume ey = 1 for almost all p € 0S)°. This implies that the quasi- 
conformal map @ is actually conformal. So there is an isometry a of §°, such 
that @ = ¢. Then, for y €T, and p € 0Sj°, we have 


a(yp) = B(yP) = p(y) - P(p) = p(y) - B(p). 
On the other hand, since G is the identity component of Isom(Sj?), it is 
normalized by a, so there is an automorphism @ of G, such that, for all g € G 
and p € O$)°, we have 
a(gp) = a(g) - a(p). 

By comparing the displayed equations (and letting g = y), we see that 
a(y) = p(y) for all y € Ty. Therefore, @ is the desired extension of p to 
an isomorphism defined on all of G. 


Case 2. Assume ey is not almost always equal to 1. Since p(y) is conformal 
(because p(y) is an isometry), we see from (15.2.2) that ey, = ep for all 
y €T, and (almost) all p € 0%. However, since G is transitive on 05)? with 
noncompact point-stabilizers, the Moore Ergodicity Theorem (14.2.5) implies 
that Ty is ergodic on 05°, so we conclude that the function €p is constant 
(a.e). 

Thus, the assumption of this case implies that, for almost every p, the 
ellipse G(C,) is not a circle, and therefore has a well-defined major axis ¢,, 
which is a line through the origin in the tangent plane at @(p). Hence, {,} is 
a (measurable) section of a certain bundle PSj?, namely, the RP'-bundle over 
$)? whose fiber at each point is the projectivization of the tangent space. Fur- 
thermore, since 'y = p(I';) is conformal, we see from (15.2.2) that this section 
is ['j-invariant. In fact, if we rotate {¢,} by any angle 6, then the resulting 
section {¢?} is also invariant (since P2 is conformal). Then Uo<o<n pollo} 


is a measurable, ['2-invariant subset of P?. However, the Moore Ergodic- 
ity Theorem (14.2.5) implies there are no such (nontrivial) subsets, since G 
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is transitive on PS? with noncompact point-stabilizers. This contradiction 
completes the proof, by showing that this case does not occur. L] 


(15.2.3) Remarks. 


1) 


The above proof makes the simplifying assumption that n = 3. Essen- 
tially the same proof works for larger values of n, but @(C,) will be an 
ellipsoid, rather than an ellipse, so the space @, of major directions may 
be a higher-dimensional subspace of the tangent space, instead of just 
being a line. 

Mostow was able to modify the proof to deal with SU(1,n), instead 
of SO(1,n), by developing a theory of maps that are quasiconformal 
over C. (In fact, replacing C with the quaternions and octonions yields 
proofs for lattices in the other simple groups of real rank one, namely, 
Sp(1,n) and Fy. However, this is not necessary, because the Margulis 
Superrigidity Theorem applies to these groups.) 


For lattices in SO(1, 7) that are not cocompact, it is not at all obvious 
that an isomorphism I; & Ig should yield a quasi-isometry H” > 9”. 
This was proved by G. Prasad, by using the “Siegel set” description of a 
coarse fundamental domain for the action of [; on 9” (cf. Chapter 19). 
The same method also works for noncocompact lattices in SU(1, 7), or, 
more generally, whenever rankg Ty = 1. 


Exercises for §15.2. 


#1. 


#2. 


Show that the proof of Theorem 15.1.2 can be reduced to the special 
case where the lattices [, and [2 are irreducible in G; and Go, re- 
spectively. In other words, assume the conclusion of Theorem 15.1.2 
holds whenever T°; is irreducible in G;, for i = 1,2, and show that this 
additional hypothesis can be eliminated. 


Construct a quasi-isometry y: 9? + ? that satisfies (15.2.1). 
[Hint: Let F be a precompact strict fundamental domain for the action of T; 


on §)°, and choose some ao € $)?. For  € 9°, let y(x) = p(y)ao, where 
rey-F.] 


§15.3. Moduli space of lattices in SL(2, R) 


Suppose I is a torsion-free, cocompact lattice in SL(2,R). In contrast to the 
Mostow Rigidity Theorem (15.1.2), we will see that an isomorphism [ = I’ 
need not extend to an automorphism of SL(2,R). In fact, there are uncount- 
ably many different embeddings of [ in SL(2,R) that are not conjugate to 
each other. 

To see this, we take a geometric approach. Since SL(2, R) acts transitively 
on the hyperbolic plane §? (by isometries), the quotient I'\S? is a compact 
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surface M. We will show there are uncountably many different possibilities 
for M (up to isometry). The proof is based on the fact that the hyperbolic 
plane has uncountably many different right-angled hexagons. 


(15.3.1) Lemma. The hyperbolic plane 9? has uncountably many different 
right-angled hexagons (no two congruent to each other). 


Proof. Fix a basepoint p € §”, and a starting direction U. For $s € (Rt), 
construct the piecewise-linear path L = L($°) in 9? determined by: 


e The path starts at the point p. P e4 
3 
e The path consists of 6 geodesic 
segments (or “edges”) e1,...,¢€6, of 


lengths s;,...,56, respectively. 


(a) es 


e The first edge e; starts at p and 


heads in the direction @ . C1 


e For i > 2, edge e; makes a Pp 
(clockwise) right angle with e;_1. 


The variables s1,..., 56 provide 6 degrees of freedom in the construction of L. 
Requiring that L be a closed path (i.e., that the terminal endpoint of L is 
equal to p) takes away two degrees of freedom (because $)? is 2-dimensional). 
Then, requiring the angle between eg and e, to be right angle takes away one 
more degree of freedom. Hence (from the Implicit Function Theorem), we 
see that there are 3 degrees of freedom in the construction of a right-angled 
hexagon in 9”. 


(15.3.2) Remark. In fact, calculations using the trigonometry of triangles 
in §)” yields the much more precise fact that, for any s2, 54, 86 € R*, there ex- 
ists a unique right-angled hexagon, with edges e1,...,e6, such that the length 
of edge e2; is exactly s2;, for i = 1,2,3. (That is, the lengths of the three 
edges €2, e4, and eg can be chosen completely arbitrarily, and they uniquely 
determine the lengths of the other three edges in the right-angled hexagon.) 


(15.3.3) Definition. A hyperbolic surface is a compact Riemannian man- 
ifold (without boundary) whose universal cover is the hyperbolic plane $7. 


(15.3.4) Corollary. There are uncountably many non-isometric hyperbolic 
surfaces of any given genus g > 2. 


Proof. Choose a right-angled hexagon P in §)”. Call its edges e1,...,é6, and 
let s; be the length of e;. Make a copy P’ of P, and form a surface P by 
gluing eg; to the corresponding edge e4, of P’, for i = 1,2,3. (Topologically, 
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this surface P is a disk with two holes, and is usually called a “pair of pants” ) 


Since P is right-angled, the three boundary curves of P are geodesics. Their 
lengths are 2s,, 253, and 2s5. 


Construct a closed surface M of genus 2 
from two copies of P, by gluing corre- 
sponding boundary components to each 
other. (For a discussion of higher genus, 
see Remark 15.3.5.) Since the only curves 
that have been glued together are geodesics, 
it is easy to see that each point in M has 
a neighborhood that is isometric to an 
open subset of 9. Therefore, the uni- 
versal cover of M is $)? (since M is com- 
plete). So M is a hyperbolic surface. 

Furthermore, from the construction, we see that M has a closed geodesic 
of length 2s,. (In fact, there is a geodesic of length 2s;, for 1 <i <6.) Since 
a single closed surface has closed geodesics of only countably many different 
lengths, but Lemma 15.3.1 implies that there are uncountably many possible 
values of s,, this implies there must be uncountably many different isometry 
classes of surfaces. O 


A surface of genus 2 can be 
made from two pairs of pants. 


(15.3.5) Remark. A hyperbolic surface M of any genus g > 2 can be con- 
structed by gluing together 2g — 2 pairs of pants: 


The lengths of the three boundary curves of each pair of pants can be var- 
ied independently (cf. Remark 15.3.2), except that curves that will be glued 
together need to have the same length. The surface can also be modified by 
rotating any boundary curve through an arbitrary angle @ before it is glued 
to its mate. This yields 6g — 6 degrees of freedom in the construction of M. 
It can be shown that this is precisely the dimension of the space of hyperbolic 
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surfaces of genus g. In other words, 6g — 6 is the dimension of the moduli 
space of hyperbolic surfaces of genus g. 


(15.3.6) Corollary. If T is any lattice in SL(2,R), then there are uncountably 
many nonconjugate embeddings of T as a lattice in SL(2,R). 


Proof. Let us assume SL(2,R)/IT is compact. (Otherwise, [ is a free group, 
so it is easy to find embeddings.) Let us also assume, for simplicity, that 
I is torsion free, so it is the fundamental group of the hyperbolic surface 
['\S?, which has some genus g. Then I is isomorphic to the fundamental 
group I” of any hyperbolic surface I’\)? of genus g. However, if \S)? is not 
isometric to I’\§?, then I cannot be conjugate to I’ in the isometry group 
of 97. Therefore, Corollary 15.3.4 implies that there must be uncountably 
many different conjugacy classes of subgroups I” that are isomorphic to I. 


§15.4. Quasi-isometric rigidity 
The Mostow Rigidity Theorem’s weak form (15.1.1) tells us (under mild hy- 


potheses) that lattices in two different semisimple Lie groups cannot be iso- 
morphic. In fact, they cannot even be quasi-isometric (see Definition 10.1.3): 


(15.4.1) Theorem. Assume 


e G, and G2 are connected, with trivial center and no compact factors, 
and 


e I; ts an irreducible lattice in G;, for i = 1,2. 
If 1) ST, then Gy = Go. 

Although nothing more than Theorem 15.4.1 can be said about quasi- 
isometric lattices that are cocompact (see Corollary 10.1.9), there is a much 
stronger conclusion for noncocompact lattices. Namely, not only are the Lie 
groups G, and G2 isomorphic, but the isomorphism can be chosen to make 
the lattices commensurable (unless G, = G2 = PSL(2, R)): 

(15.4.2) Theorem. Assume 


e G, and Gp» are connected, with trivial center and no compact factors, 
and 


e I; ts an irreducible lattice in G;, for i = 1,2. 
Then T, © To if and only if G, & G2 and either 
1) both G1/T, and G2/T2 are compact, or 
2) there is an isomorphism o: Gy > Go, such that (11) is commensurable 
to I's, or 
3) G, and G2 are isomorphic to PSL(2,R), and neither G,/T, nor G2/T2 
is compact. 
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One of the key ingredients in the proof of this theorem is the fact that 
any group quasi-isometric to a lattice is isomorphic to a lattice, modulo finite 
groups: 


(15.4.3) Theorem. If A is a finitely generated group that is quasi-isometric 
to an irreducible lattice T in G, then there are 


e a finite-index subgroup A’ of A, and 
e a finite, normal subgroup N of A’, 


such that A'/N is isomorphic to a lattice in G. 


Sketch of proof of a special case. Assume G = SO(1,3) and [I is co- 
compact. The group A acts on itself by translation. Since A + T ~ §3, this 
provides an action of A by quasi-isometries on §)°. Let A be the correspond- 
ing group of quasiconformal maps on 0§)°. Note that the quasiconformality 
constant « is uniformly bounded on A. 

Let (08?)3 be the space of ordered triples of distinct points in 09°, and 
define p: (09?)3 > 9° by letting p(a,b,c) be the point on the geodesic ab 
that is closest to c. It is not difficult to see that p is compact-to-one. Since the 
action of A is cocompact on §)°, this implies that the action of A is cocompact 
on (09?)3. 

The above information allows us to apply a theorem of Tukia to conclude 
that A is quasiconformally conjugate to a subgroup of SO(1,n). Hence, after 
conjugating the action of A by a quasi-isometry, we may assume A C SO(1, 7). 
Furthermore, if we fix a basepoint x9, then the map A +> Aq is a quasi- 
isometry from A to §°. This implies that A is a cocompact lattice in SO(1, 7). 

0 


Many additional ideas are needed to prove Theorem 15.4.2, but this suf- 
fices for the weaker version: 


Proof of Theorem 15.4.1. Theorem 15.4.3 tells us that [2 is isomorphic to 
a lattice in G, (if we ignore some finite groups). So Ty is a lattice in both G; 
and Gp. Therefore, the Mostow Rigidity Theorem (15.1.1) implies G; © Go, 
as desired. O 


From the Mostow Rigidity Theorem, we know that every automorphism 
of TP extends to an automorphism of G (if G is not isogenous to SL(2, R) and we 
ignore compact factors and the center). The following analogue of this result 
for quasi-isometries is another key ingredient in the proof of Theorem 15.4.2. 


(15.4.4) Theorem. Assume 
e T ts irreducible, and not cocompact, 


e G has trivial center and no compact factors, 


e G is not isogenous to SL(2,R), and 
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e f:T oP. 


Then f is a quasi-isometry if and only if it is at bounded distance from some 
automorphism of G. 


Exercises for §15.4. 


#1. Assume the hypotheses of Theorem 15.1.2, and also assume G/T, is 
not compact. Show that the conclusion of Theorem 15.1.2 can be ob- 
tained by combining Theorem 15.4.2 with Theorem 15.4.4. 


Notes 


The Mostow Rigidity Theorem (15.1.2) is a combination of (overlapping) 
special cases proved by three authors: 


e Mostow: IT is cocompact (case where G = SO(1,n) [4], general case 
[5]), 

e Prasad: rankg IT = 1 (which includes the case where I is not cocompact 
and rankrg G = 1) [6], 


e Margulis: rankg G > 2 (and [ is irreducible) (see Subsection 16.2(i)). 


In recognition of this, many authors call Theorem 15.1.2 the Mostow-Prasad 
Rigidity Theorem, or the Mostow-Prasad-Margulis Rigidity Theorem. 

The results described in Remark 15.1.3(3) are due to A. Furman [3]. 

See the exposition in [7, §5.9, pp. 106-112] for more details of the proof in 
Section 15.2. A different proof of this special case was found by Gromov, and 
is described in [7, §6.3, pp. 129-130]. Yet another nice proof (which applies 
to cocompact lattices in any groups of real rank one) appears in [1, §5.2]. 

Regarding Remark 15.2.3(2), see [5, §21, esp. (21,18)] for a discussion of 
the notion of maps that are quasiconformal over C (or H, or O). 

Regarding Remark 15.2.3(3), see [6] for Prasad’s proof of Mostow rigidity 
for lattices of Q-rank one. 

Regarding Section 15.3, see [7, Thm. 5.3.5] for a more complete discussion 
of “pairs of pants” and the dimension of the space of hyperbolic metrics on a 
surface of genus g. (The calculations to justify Remark 15.3.2 can be found 
in (7, §2.6].) 

The results on quasi-isometric rigidity in Section 15.4 include work of 
A. Eskin, B. Farb, B. Kleiner, B. Leeb, P. Pansu, R. Schwarz, and others. See 
the survey [2] for references, discussion of the proofs, and other information. 
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Chapter 16 


Margulis Superrigidity 
Theorem 


Roughly speaking, the Margulis Superrigidity Theorem tells us that homo- 
morphisms defined on TI can be extended to be defined on all of G (unless G 
is either SO(1,n) or SU(1,n)). In cases where it applies, this fundamental 
theorem is much stronger than the Mostow Rigidity Theorem (15.1.2). It also 
implies the Margulis Arithmeticity Theorem (5.2.1 or 16.3.1). 


§16.1. Statement of the theorem 


It is not difficult to see that every group homomorphism from Z* to R” can 
be extended to a continuous homomorphism from R* to R” (see Exercise 1). 
Noting that Z” is a lattice in R*, it is natural to hope that, analogously, 
homomorphisms defined on I’ can be extended to be defined on all of G. The 
Margulis Superrigidity Theorem shows this is true if G has no simple factors 
isomorphic to SO(1,m) or SU(1,m), except that the conclusion may only be 
true modulo finite groups and up to a bounded error. Here is an illustrative 
special case that is easy to state, because the bounded error does not arise. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, [is a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Mai ist , except that the proof of the main 
theorem requires real ran apter 8), amenability (Furstenberg’s Lemma 
(12.6.1)), and the Moore Ergodicity Theorem (Section 11.2). 
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(16.1.1) Theorem (Margulis). Assume 
e G=SL(k,R), with k > 3, 


e G/T is not compact, and 


e vy: T > GL(n,R) is any homomorphism. 


Then there exist: 
e a continuous homomorphism @: G — GL(n,R), and 
e a finite-index subgroup I’ of T, 

such that @(y) = y(y) for ally ET’. 


Proof. See Exercise 16.4#2. LO 


Here is a much more general version of the theorem that has a slightly 
weaker conclusion. To simplify the statement, we preface it with a definition. 


(16.1.2) Definition. G is algebraically simply connected if (for every ¢) 
every Lie algebra homomorphism g — Sl(¢,IR) is the derivative of a well- 
defined Lie group homomorphism G —> SL(£, R). 


(16.1.3) Remark. Every simply connected Lie group is algebraically simply 
connected, but the converse is not true (see Exercise 2). In general, if G 
is connected, then some finite cover of G is algebraically simply connected. 
Therefore, assuming that G is algebraically simply connected is just a minor 
technical assumption that avoids the need to pass to a finite cover. 


(16.1.4) Theorem (Margulis Superrigidity Theorem). Assume 


i) G is connected, and algebraically simply connected, 


ii) G is not isogenous to any group that is of the form SO(1,m) x K or 
SU(1,m) x K, where K is compact, 


iii) [ is irreducible, and 


iv) gy: T > GL(n,R) is a homomorphism. 


Then there exist: 
1) a continuous homomorphism @: G > GL(n,R), 
2) a compact subgroup C of GL(n,R) that centralizes G(G), and 
3) a finite-index subgroup I” of IT, 

such that p(y) € G(y) C, for ally ET". 


Proof. See Section 16.5. O 


(16.1.5) Remarks. 


1) Since y(y) € G(y) C, we have G(y)~! y(y) € C for all y. Therefore, 
although @(y) might not be exactly equal to y(y), the error is an ele- 
ment of C, which is a bounded set (because C' is compact). Hence, the 
size of the error is uniformly bounded on all of I’. 
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2) Assumption (ii) cannot be removed. For example, if G = PSL(2,R), 
then the lattice [ can be a free group (see Remark 6.1.6). In this case, 
there exist many, many homomorphisms from I into any group G’, and 
many of them will not extend to G (see Exercise 5). 


If we make an appropriate assumption on the range of y, then there is no 
need for the compact error term C or the finite-index subgroup I’: 
(16.1.6) Corollary. Assume 


i) G is not isogenous to any group that is of the form SO(1,m) x K or 
SU(1,m) x K, where K is compact, 


ii) T is irreducible, and 
iii) G and G’ are connected, with trivial center, and no compact factors. 


If pe: T > G' is any homomorphism, such that p(T) is Zariski dense in G’, 
then extends to a continuous homomorphism ~: G > G’. 


Proof. See Exercise 6. 


Because of our standing assumption (4.0.0) that G’ is semisimple, Corol- 


lary 16.1.6 implicitly assumes that the Zariski closure y(I) = G’ is semisimple. 
In fact, that is automatically the case: 
(16.1.7) Corollary. Assume 


i) G is not isogenous to any group that is of the form SO(1,m) x Kk or 
SU(1,m) x K, where K is compact, 


ii) Tis irreducible, and 


iii) gy: T > GL(n, R) is a homomorphism. 


Then y(T) is semisimple. 


Proof. See Exercise 9. 


Exercises for §16.1. 


#1. Suppose y is a homomorphism from Z* to R”. Show that y extends to 
a continuous homomorphism from R* to R”. 


[Hint: Let @: R* > R” be a linear transformation, such that @(e;) = y(e:), 
where {€1,...,¢%} is the standard basis of R*.] 

#2. Show that SL(n,R) is algebraically simply connected. (On the other 
hand, SL(n, R) is not simply connected, because its fundamental group 
is nontrivial.) 


[Hint: By tensoring with C, any homomorphism $l(n,R) — sI(@,R) extends 
to a homomorphism sl(n,C) > sl(¢,C), and SL(n, C) is simply connected.] 


#3. Assume 
e G is not isogenous to SO(1,7) or SU(1,n), for any n, 
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#16. 
#7. 


#8. 
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e IT is irreducible, and 

e G has no compact factors. 
Use the Margulis Superrigidity Theorem to show that the abelianization 
T/|C,T] of T is finite. (When G is simple, this was already proved from 
Kazhdan’s property (7) in Corollary 13.4.3(2). We will see yet another 
proof in Exercise 17.1#1.) 


Assume G, IT, y, @, C, and I” are as in Theorem 16.1.4. Show there is a 
homomorphism e: I’ + C, such that y(y) = ¢(7) - €(y), for all y € I”. 


Suppose G = PSL(2,R) and I is a free group. Construct a homomor- 
phism y: T > GL(n,R) (for some n), such that, for every continuous 
homomorphism ¢: G + GL(n,R), and every finite-index subgroup I’ 
of T, the set { @(y)~1 v(y) | y € I’ } is not precompact. 


[Hint: yp may have an infinite kernel.] 
Prove Corollary 16.1.6 from Theorem 16.1.4. 


Show that the extension ¢ in Corollary 16.1.6 is unique. 
[Hint: Borel Density Theorem. ] 


In each case, find 
e a lattice [ in G and 
e a homomorphism y: T > G’, 
such that 
e (IL) is Zariski dense in G’, and 
e » does not extend to a continuous homomorphism ¢: G > G’. 
Also explain why they are not counterexamples to Corollary 16.1.6. 
a) G=G" = PSL(2,R) x PSL(2, R). 
b) G= PSL/(4, R) and G’ = SL(4,R). 
c) G=SO(2,3) and G’ = SO(2,3) x SO(5). 


Prove Corollary 16.1.7 from Theorem 16.1.4. 


Derive Theorem 16.1.4 from the combination of Corollary 16.1.6 and 
Corollary 16.1.7. (This is a converse to Exercises 6 and 9.) 


§16.2. Applications 


We briefly describe a few important consequences of the Margulis Superrigid- 
ity Theorem. 
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§16.2(i). Mostow Rigidity Theorem. The special case of the Mar- 
gulis Superrigidity Theorem (16.1.6) in which the homomorphism y is as- 
sumed to be an isomorphism onto a lattice I’ in G’ is very important: 
(16.2.1) Theorem (Mostow Rigidity Theorem, cf. (15.1.2)). Assume 

e G, and G2 are connected, with trivial center and no compact factors, 

e G,  PSL(2,R), 

e I; ts an irreducible lattice in G;, fori = 1,2, and 

e oy: Ty — [2 ts a group isomorphism. 

Then yp extends to a continuous isomorphism from Gy to Go. 

This theorem has already been discussed in Chapter 15. In most cases, 
it follows easily from the Margulis Superrigidity Theorem (see Exercise 1). 
However, since the superrigidity theorem does not apply when G, is either 


SO(1,m) or SU(1,m), a different argument is needed for those cases; see 
Section 15.2 for a sketch of the proof. 


§16.2(ii). Triviality of flat vector bundles over G/T. 


(16.2.2) Definition. For any homomorphism y: T — GL(n,R), there is a 
diagonal action of [ on G x R”, defined by 
(x, v) + = (27, 9(77")2)- 

Let €, = (G x R")/T be the space of orbits of this action. Then there is a 
well-defined map 

m: Ey + G/T, defined by x([z, v]) = aI, 
and this makes €,, into a vector bundle over G/T (with fiber R”) (see Exer- 
cise 3). A vector bundle defined from a homomorphism in this way is said to 
be a flat vector bundle. 


The Margulis Superrigidity Theorem implies (in some cases) that every 
flat vector bundle over G/T is nearly trivial. Here is an example: 


(16.2.3) Proposition. Let G=SL(n,R) and l = SL(n,Z). If Ey is any flat 
vector bundle over G/T, then there is a finite-index subgroup I’ of T, such 
that the lift of E, to the finite cover G/T” is trivial. 

In other words, if we let yp’ be the restriction of yp to I’, then the vector 
bundle E,, is isomorphic to the trivial vector bundle (G/T’) x R”. 


Proof. From Theorem 16.1.1, we may choose I” so that the restriction y’ 
extends to a homomorphism ¢: G + GL(n, R). Define a continuous function 
T: Gx R" >G x R” by 


T(g,v) = (9, A(g)v). 
Then, for any y € I”, a straightforward calculation shows 


T((g,v):7) =T(g,v) *y, where (g,v) * y = (97,2) (16.2.4) 
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(see Exercise 4). Therefore T factors through to a well-defined bundle iso- 
morphism Ey, = (G/T’) x R”. O 


§16.2(iii). Embeddings of locally symmetric spaces from embed- 
dings of lattices. Let M =I[\G/K and M' =I’\G'/K’. Roughly speaking, 
the Mostow Rigidity Theorem (15.1.2) tells us that if [ is isomorphic to I’, 
then M is isometric to M’. More generally, superrigidity implies that if T’ is 
isomorphic to a subgroup of I’, then M is isometric to a submanifold of M’ 
(modulo finite covers). 


(16.2.5) Proposition. Suppose 


e M =T\G/K and M = I’\G/K’' are irreducible locally symmetric 
spaces with no compact factors, 


e I is isomorphic to a subgroup of I’, and 


e the universal cover of M is neither the real hyperbolic space §)" nor the 
complex hyperbolic space CH”. 


Then some finite cover of [\G/K embeds as a totally geodesic submanifold 
of a finite cover of T’\G'/K’. 


Idea of proof. There is no harm in assuming that G and G’ have trivial 
center and no compact factors. After passing to a finite-index subgroup of I 
(and ignoring a compact group C), the Margulis Superrigidity Theorem tells 
us that the embedding [ <> I” extends to a continuous embedding y: G > 
G’. Conjugate » by an element of G’, so that y(K) C kK’, and y(G) is 
invariant under the Cartan involution of G’ corresponding to the maximal 
compact subgroup K’. Then y induces an embedding [\G/K — I’\G'/K' 
whose image is a totally geodesic submanifold. LJ 


Exercises for §16.2. 


#1. Prove the Mostow Rigidity Theorem (16.2.1) under the additional as- 
sumption that G1 is neither PSO(1,n) nor PSU(1,7). 


#2. The statement of the Mostow Rigidity Theorem in Theorem 15.1.2 is 
slightly different from Theorem 16.2.1. Show that these two theorems 
are corollaries of each other. 

[Hint: Exercise 15.2#1.] 


#3. In the notation of Definition 16.2.2: 
a) Show the map 7 is well defined. 
b) Show €, is a vector bundle over G/T with fiber R”. 


#4. Verify (16.2.4) for all y € I’. 
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§16.3. Why superrigidity implies arithmeticity 


Recall the following major theorem that was stated without proof in Theo- 
rem 5.2.1: 


(16.3.1) Margulis Arithmeticity Theorem. Every irreducible lattice in G 
is arithmetic, except, perhaps, when G is isogenous to SO(1,m) x K or 
SU(1,m) x K, for some compact group K. 


This important fact is a consequence of the Margulis Superrigidity Theo- 
rem, but the implication is not at all obvious. In this section, we will explain 
the main ideas that are involved. 

In addition to our usual assumption that G C SL(@, R), let us also assume, 
for simplicity: 


e G&SL(3,R) (or, more generally, G is algebraically simply connected; 
see Definition 16.1.2), and 


e G/T is not compact. 


We wish to show that IT is arithmetic. It suffices to show I C Gz, that is, 
that every matrix entry of every element of I is an integer, for then I is 
commensurable to G'z (see Exercise 4.1410). 

Here is a loose description of the 4 steps of the proof: 


1) The Margulis Superrigidity Theorem (16.1.1) implies that every matrix 
entry of every element of [ is an algebraic number. 


2) By restriction of scalars, we may assume that these algebraic numbers 
are rational; that is, f C Gg. 


3) For every prime p, a “p-adic” version of the Margulis Superrigidity 
Theorem provides a natural number N,, such that no element of I has 
a matrix entry whose denominator is divisible by p’?. 


4) This implies that some finite-index subgroup I” of [ is contained in Gz. 


Step 1. Every matrix entry of every element of IT is an algebraic number. 
Suppose some 7¥;,; is transcendental. Then, for any transcendental number a, 
there is a field automorphism ¢ of C with ¢(%,;) = a. Applying ¢ to all the 
entries of a matrix induces an automorphism ob of SL(¢,C). Let 


y be the restriction of ob to T, 


so y is a homomorphism from [I to SL(é,C). The Margulis Superrigidity 
Theorem implies there is a continuous homomorphism ¢: G — SL(@,C), such 
that ¢ = vy on a finite-index subgroup of T' (see Exercise 16.4442). By passing 
to this finite-index subgroup, we may assume @ = on all of T. 

Since there are uncountably many transcendental numbers a, there are 
uncountably many different choices of ¢, so there must be uncountably many 
different n-dimensional representations ¢ of G. However, it is well known 
from the the theory of “roots and weights” that G (or, more generally, any 
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connected, simple Lie group) has only finitely many non-isomorphic represen- 
tations of any given dimension, so this is a contradiction.® 


Step 2. We have T C SL(¢,Q). Let F be the subfield of C generated by the 
matrix entries of the elements of T, so T C SL(¢, F’). From Step 1, we know 
that this is an algebraic extension of Q. Furthermore, because I is finitely 
generated (see Theorem 4.7.10), we see that this field extension is finitely 
generated. Therefore, F' is finite-degree field extension of Q (in other words, 
F is an algebraic number field). This means that F’ is almost the same as Q, 
so it is only a slight exaggeration to say that we have proved [ C SL(Z, Q). 

Indeed, restriction of scalars (5.5.8) provides a way to change F' into Q: 
there is a representation p: G — SL(r,C), for some r, such that p(G ‘a 
SL(¢, F)) C SL(r,Q) (see Exercise 1). Therefore, after replacing G with 
p(G), we have the desired conclusion (without any exaggeration). 


Step 3. For every prime p, there is a natural number Np, such that no element 
of T has a matrix entry whose denominator is divisible by pN’. The fields 
R and C are complete (that is, every Cauchy sequence converges), and they 
obviously contain Q. For any prime p, the p-adic numbers Q, are another 
field that has these same properties. 

As we have stated it, the Margulis Superrigidity Theorem deals with ho- 
momorphisms into SL(¢, F), where F = R, but Margulis also proved a version 
of the theorem that applies when F is a p-adic field (see Theorem 16.3.2). Now 
G is connected, but p-adic fields are totally disconnected, so every continuous 
homomorphism from G to SL(é,Q,,) is trivial. Therefore, superrigidity tells 
us that y is trivial, after we mod out a compact group (cf. Theorem 16.1.4). 
In other words, the closure of y(I) is compact in SL(Z, Q,). 

This conclusion can be rephrased in more elementary terms, without any 
mention of p-adic numbers. Namely, it says that there is a bound on the 
highest power of p that divides the denominator of any matrix entry of any 
element of [. This is what we wanted. 


Step 4. Some finite-index subgroup I’ of T is contained in SL(¢,Z). Let 
D CN be the set consisting of the denominators of the matrix entries of the 
elements of (I). 

We claim there exists N € N, such that every element of D is less than N. 
Since [ is known to be finitely generated, some finite set of primes {p,,...,p,} 
contains all the prime factors of every element of D. (If p is in the denominator 
of some matrix entry of 7172, then it must appear in a denominator somewhere 


3 Actually, this is not quite a contradiction, because it is possible that two different 
choices of y yield the same representation of I, up to isomorphism; that is, after a change 
of basis. The trace of a matrix is independent of the basis, so the preceding argument really 
shows that the trace of y(y) must be algebraic, for every y € [. Then one can use some 
algebraic methods to construct some other matrix representation y’ of I, such that the 
matrix entries of y’(y) are algebraic, for every y €T. 
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in either 7 or 72.) Therefore, every element of D is of the form pj"! ---pi”", 
for some m1,...,™m, € N. From Step 3, we know m; < Np,, for every i. Thus, 


every element of D is less than py? tee per. This establishes the claim. 

From the preceding paragraph, we see that I C 3; Mate (Z). Note that 
if N = 1, then  C SL(¢,Z). In general, N is a finite distance from 1, so it 
should not be hard to believe (and it can indeed be shown) that some finite- 
index subgroup of I must be contained in SL(@, Z) (see Exercise 2). Therefore, 
a finite-index subgroup of I is contained in Gz, as desired. 


For ease of reference, we officially record the key fact used in Step 3: 


(16.3.2) Theorem (Margulis superrigidity over p-adic fields). Assume 
i) G is not isogenous to any group that is of the form SO(1,m) x K or 
SU(1,m) x K, where K is compact, 
ii) Tis irreducible, 
iii) Q, is the field of p-adic numbers, for some prime p, and 
iv) yp: FT > GL(n,Q,) is a homomorphism. 


Then y(L) is compact. 

In other words, there is some N € Z, such that every matrix entry of 
every element of (I) is in pN Zy, where Zp is the ring of p-adic integers. 

The Margulis Arithmeticity Theorem (16.3.1) does not apply to lattices 
in SO(1,n) or SU(1,n), but, for those groups, Margulis proved the following 
characterization of the lattices that are arithmetic: 

(16.3.3) Commensurability Criterion for Arithmeticity (Margulis). As- 
sume 

e G is connected, with no compact factors, and 

e [ts trreducible. 

Then I is arithmetic if and only if the commensurator Commg(l) of T is 
dense in G. 

As was already mentioned in Remark 5.2.5(1), the direction (=) follows 
from the simple observation that Commg(Gz) contains Gog. 

The proof of (<) is more difficult. It is the same as the proof of the 
Margulis Arithmeticity Theorem, but replacing the Margulis Superrigidity 
Theorem (16.1.6) with the following superrigidity theorem (and also replacing 
the p-adic superrigidity theorem with a suitable commensurator analogue): 


(16.3.4) Theorem (Commensurator Superrigidity). Assume 
i) T ts irreducible, 
ii) Commag(T) is dense in G, and 
iii) G and G’ are connected, with trivial center, and no compact factors. 


If p: Commg(T) > G' is any homomorphism whose image is Zariski dense 
in G', then p extends to a continuous homomorphism @: G > G’. 
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Exercises for 816.3. 


#1. Suppose 
e GCSL(Z¢,C), 
e I CSL(¢, F), for some algebraic number field F’, and 
e G has no compact factors. 
Show there is a continuous homomorphism p: G — SL(r,C), for some r, 
such that p(GMSL(é, F)) C SL(r, Q). 


[Hint: Apply restriction of scalars (§5.5) after noting that the Borel Density 
Theorem (§4.5) implies G is defined over F’.] 


#2. Show that if A is a subgroup of SL(@,Q), and A C x Matexe(Z), for 
some N EN, then SL(¢, Z) contains a finite-index subgroup of A. 
[Hint: The additive group of Q’ contains a A-invariant subgroup V, such 
that we have Z‘ CV C 4Z‘. Choose g € GL(¢,Q), such that g(V) = Z°. 
Then g commensurates SL(é,Z) and we have gAg~* C SL(é, Z).] 


#3. Assume, as usual, that 
e G is not isogenous to any group that is of the form SO(1,m) x K 
or SU(1,m) x K, where K is compact, and 
e I is irreducible. 
Use the proof of the Margulis Arithmeticity Theorem to show that if 
yp: T > SL(n, C) is any homomorphism, then every eigenvalue of every 
element of y(L) is an algebraic integer. 


§16.4. Homomorphisms into compact groups 


The Margulis Superrigidity Theorem (16.1.4) does not say anything about ho- 
momorphisms whose image is contained in a compact subgroup of GL(n, R). 
(This is because all of y([) can be put into the error term C’, so the ho- 
momorphism @ can be taken to be trivial.) Fortunately, there is a different 
version that completely eliminates the error term (and applies very gener- 
ally). Namely, from the Margulis Arithmeticity Theorem (5.2.1), we know 
that the lattice [ must be arithmetic (if no simple factors of G are SO(1,m) 
or SU(1,m)). This means that if we add some compact factors to G, then 
we can assume that I is commensurable to Gz. In this situation, there is no 
need for the error term C’: 


(16.4.1) Corollary. Assume 


e G is connected, algebraically simply connected, and defined over Q, 


e G is not isogenous to any group that is of the form SO(1,m) x K or 
SU(1,m) x K, where K is compact, 


e I is irreducible, and 


e vy: TF > GL(n,R) is a homomorphism. 
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If T is commensurable to Gz, then there exist: 
1) a continuous homomorphism @: G —+ GL(n,R), and 
2) a finite-index subgroup I’ of IT, 
such that p(y) = ¢(7), for ally eT". 
Here is a less precise version of Corollary 16.4.1 that may be easier to ap- 
ply in situations where the lattice [is not explicitly given as the Z-points of G. 


However, it only applies to the homomorphism into each simple component 
of a([), not to the entire homomorphism all at once. 


(16.4.2) Corollary. Assume 
e G is connected, and algebraically simply connected, 


e G is not isogenous to any group that is of the form SO(1,m) x K or 
SU(1,m) x K, where K is compact, 


I is trreducable, 


yp: T > GL(n, C) is a homomorphism, and 
e y(I) is simple. 
Then there exist: 
1) a continuous homomorphism @: G + GL(n,C), 
2) a finite-index subgroup I’ of T, and 
3) a Galois automorphism o of C, 
such that p(y) = o(G(7)), for all y EI”. 


Proof. We may assume y(I) is compact, for otherwise Corollary 16.1.6 ap- 
plies (after modding out the centers of G and a(I)). Then every element of 
p(L) is semisimple. 

Choose some h € y(I’), such that h has infinite order (see Exercise 4.8412). 
Then the conclusion of the preceding paragraph implies that some eigen- 
value A of h is not a root of unity. On the other hand, if is algebraic, then 
p-adic superrigidity (16.3.2) implies that » is an algebraic integer (see Exer- 
cise 16.33). So there is a Galois automorphism o of C, such that |o(A)| 4 1 
(see Exercise 4). Then {o(A)* | k € Z} is an unbounded subset of C, so 
(a(h)) is not contained in any compact subgroup of GL(n, C). 

Now, let 


e y’ be the composition a o y, and 
e G’ be the Zariski closure of y’(L). 


Then G’ is simple, and the conclusion of the preceding paragraph implies that 
G’ is not compact (since o(h) € G’). After passing to a finite-index subgroup 
(so G’ is connected), Corollary 16.1.6 provides a continuous homomorphism 
p: G > G’, such that y’(y) = 97), for all y in some finite-index subgroup 
of [. 
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(16.4.3) Warning. Assume [ is irreducible, and G is not isogenous to any 
group of the form SO(1,m) x K or SU(1,m) x K. Corollary 16.4.2 implies that 
if there exists a homomorphism y from [' to a compact Lie group (and y(TL) is 
infinite), then G/T’ must be compact (see Exercise 1). However, the converse 
is not true. Namely, Corollary 16.4.1 tells us that if [ is commensurable to 
Gz, where G is defined over Q, and Gp has no compact factors, then T does 
not have any homomorphisms to compact groups (with infinite image). It 
does not matter whether G/T is compact or not. 


Exercises for §16.4. 


iy 


#2. 


#3. 


4A, 


Assume, as usual, that the lattice I’ is irreducible, that G is not isoge- 
nous to any group of the form SO(1,m) x K or SU(1,m) x K, and that 
y: T — GL(n,R) is a homomorphism. If G/T is not compact, show 


the semisimple group y(I) has no compact factors. 
[Hint: Godement’s Criterion (5.3.1).] 


Assume 
e G is algebraically simply connected, 
e G is not isogenous to any group that is of the form SO(1,m) x K 
or SU(1,m) x K, where K is compact, 
e I is irreducible, 
e G/T is not compact, and 
e y: T > SL(n,R) is a homomorphism. 
Show there is a continuous homomorphism ¢: G — SL(n,R), such that 
v(y) = (7) for all y in some finite-index subgroup of I. 
[Hint: Theorem 16.1.4, Corollary 16.1.7, and Exercise 1.] 


Assume I is irreducible, and G has no factors isogenous to SO(1,m) or 
SU(1,m). Show that if N is an infinite normal subgroup of I’, such that 
I'/N is linear (i.e., isomorphic to a subgroup of GL(¢,C), for some £), 
then ['/N is finite. 


(Kronecker’s Theorem) Assume is an algebraic integer. Show that if 
|o(A)| = 1 for every Galois automorphism o of C, then X is a root of 
unity. 

[Hint: The powers of X form a set that (by restriction of scalars) is discrete 


in Xoeg0 FS. Alternate proof: there are only finitely many polynomials of 
degree n with integer coefficients that are all < C in absolute value.] 


§16.5. Proof of the Margulis Superrigidity Theorem 


In order to establish Corollary 16.1.6, it suffices to prove the following special 
case (see Exercise 1): 
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(16.5.1) Theorem. Suppose 


e G is connected, and it is not isogenous to any group that is of the form 
SO(1,m) x K or SU(1,m) x K, where K is compact, 


e the lattice T’ is irreducible in G, 


e H is a connected, noncompact, simple subgroup of SL(n,R), for some n 
(and H has trivial center), 


e oy: Tl > A is a homomorphism, and 
e o(I) ts Zariski dense in H. 


Then vy extends to a continuous homomorphism @: G > H. 
Although it does result in some loss of generality, we assume: 
(16.5.2) Assumption. rankp G > 2. 


The case where rankg G = 1 requires quite different methods — see Sec- 
tion 16.8 for a very brief discussion. 


§16.5(i). Geometric reformulation. To set up the proof of Theo- 
rem 16.5.1, let us translate the problem into a geometric setting, by replacing 
the homomorphism y with the corresponding flat vector bundle €,, over G/T 
(see Definition 16.2.2). 


(16.5.3) Remark. The sections of the vector bundle €, are in natural one-to- 
one correspondence with the right [-equivariant maps from G to R” (see Ex- 
ercise 2). 


(16.5.4) Lemma. y extends to a homomorphism ¢: G + GL(n, R) if and only 
if there exists a G-invariant subspace V C Sect(E,), such that the evaluation 
map V — Vie is bijective. 


Proof. (<) Since V is G-invariant, we have a representation of G on V; let 
us say 7: G — GL(V). Therefore, the isomorphism V — V;.} = R” yields 
a representation 7 of G on R”. It is not difficult to verify that 7 extends y 
(see Exercise 3). 

(=) For v € R” and g € G, let 


Ex(g) = P(g"). 
It is easy to verify that £,: G — R” is right T-equivariant (see Exercise 4), so 
we may think of €, as a section of €, (see Remark 16.5.3). Let 
V={& |v €R"} C Sect(E,). 
Now the map vu +> , is linear and G-equivariant (see Exercise 5), so V is a 
G-invariant subspace of Sect(€,). Since 
Ev ([e]) = Gle)v =», 


it is obvious that the evaluation map is bijective. 
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In fact, if we assume the representation y is irreducible, then it is not 
necessary to have the evaluation map V — V,, be bijective. Namely, in 
order to show that y extends, it suffices to have V be finite dimensional (and 
nonzero): 


(16.5.5) Lemma (see Exercise 6). Assume that the representation p is ir- 
reducible. If there exists a (nontrivial) G-invariant subspace V of Sect(E.,) 
that is finite dimensional, then yp extends to a continuous homomorphism 
ep: G> GL(n,R). 


§16.5(ii). The need for higher real rank. We now explain how As- 
sumption 16.5.2 comes into play. 


(16.5.6) Notation. Let A be a maximal R-split torus of G. For example, if 
G = SL(8, R), we let 
* 00 
A= ) * i , 


00x 
By definition, the assumption that rankg G > 2 means dim A > 2. 


It is the following result that relies on our assumption rankpG > 2. It 
is easy to prove if G has more than one noncompact simple factor (see Exer- 
cise 7), and is not difficult to verify for the case G = SL(¢, R) (cf. Exercise 8). 
Readers familiar with the structure of semisimple groups (including the the- 
ory of real roots) should have little difficulty in generalizing to any semisimple 
group of real rank > 2 (see Exercise 9). 


(16.5.7) Lemma. Jf rankg G > 2, then, for some r € N, there exist closed 
subgroups Ly, Lo,...,L, of G, such that 
1) G= Ly Ly} a “Ty, and 
2) both H; and H;} are noncompact, where 
e A, = L,0A4, and 
e H+ =Ca(L;) (so L; centralizes H;). 


§16.5(iii). Outline of the proof. The idea for proving Theorem 16.5.1 
is quite simple. We begin by finding a (nonzero) A-invariant section of €,; this 
section spans a (1-dimensional) subspace Vo of €, that is invariant under A. 
Since (by definition) the subgroup H, of Lemma 16.5.7 is contained in A, 
we know that Vo is invariant under H;, so Lemma 16.5.8 below provides a 
subspace of Sect(€,) that is invariant under a larger subgroup of G, but is still 
finite dimensional. Applying the lemma repeatedly yields finite-dimensional 
subspaces that are invariant under more and more of G. Eventually, the 
lemma yields a finite-dimensional subspace that is invariant under all of G. 
Then Lemma 16.5.5 implies that y extends to a homomorphism that is defined 
on G, as desired. 
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(16.5.8) Lemma. [f 

e Hf is a closed, noncompact subgroup of A, and 

e V is an H-invariant subspace of Sect(E,,) that is finite dimensional, 
then (Ca(H)-V) is finite dimensional. 


Idea of proof. To illustrate the idea of the proof, let us assume that V = Ro 
is the span of an H-invariant section (see Exercise 13). Since H is noncompact, 
the Moore Ergodicity Theorem (14.2.4) tells us that H has a dense orbit on 
G/T (see Exercise 14.2#16). (In fact, almost every orbit is dense.) This 
implies that any continuous H-invariant section of €, is determined by its 
value at a single point (see Exercise 10), so the space of H-invariant sections 
is finite-dimensional (see Exercise 11). Since this space contains (Cg(H) - V) 
(see Exercise 12), the desired conclusion is immediate. 


Here is a more detailed outline: 


Idea of the proof of Theorem 16.5.1. Assume there exists a nonzero 
A-invariant section o of E,. Let 
Ho = A and VY = (0). 

Thus, Vo is a 1-dimensional subspace of Sect(€,,) that is Ho-invariant. 

Now, for i = 1,...,7r, let 

VY, = (L,-A-Djy-1-A---L,-A-Vo). 

Since L,L,_1---L, = G, it is clear that V, is G-invariant. Therefore, it will 
suffice to show (by induction on 7) that each V; is finite dimensional. 

Since H;_1 C L,_1, it is clear that V;_, is H;_1-invariant. Therefore, since 
A centralizes H;-,, Lemma 16.5.8 implies that (A-V;_1) is finite dimensional. 
Now, since H C A, we know that (A-Vj_1) is Hj-invariant. Then, since L; 
centralizes H;+, Lemma 16.5.8 implies that the subspace V; = (L;- A - Vi-1) 
is finite dimensional. 


Therefore, the key to proving Theorem 16.5.1 is finding a nonzero A- 
invariant section o of €,. Unfortunately, the situation is a bit more compli- 
cated than the above would indicate, because we will not find a continuous 
A-invariant section, but only a measurable one (see Key Fact 16.6.1). Then 
the proof appeals to Lemma 16.5.10 below, instead of Lemma 16.5.8. We 
leave the details to the reader (see Exercise 16). 


(16.5.9) Definition. Let Sectmeas(E,) be the vector space of measurable sec- 
tions of €,, where two sections are identified if they agree almost everywhere. 
(16.5.10) Lemma (see Exercises 14 and 15). If 

e 7 is a closed, noncompact subgroup of A, and 

e V is a finite-dimensional, H-invariant subspace of Sectmeas(Ey), 
then (Cqa(H)-V) is finite dimensional. 


336 


16. MARGULIS SUPERRIGIDITY THEOREM 


Exercises for 816.5. 


#1 
a) 


£3. 


4A, 


$9. 


#6. 


#7. 


#8. 


#9, 
4£10. 


#11. 


#12. 


Derive Corollary 16.1.6 as a corollary of Theorem 16.5.1. 
Suppose €: G + R”. Show that €: G/T > Eg, defined by 
E(gV) = [(9,€(9))], 


is a well-defined section of €, if and only if € is right I'-equivariant; i.e., 


E(gy) = v(y~") &(g)- 


In the notation of the proof of Lemma 16.5.4(<), show 7(y) = (7) 
for every y ET. 


In the notation of the proof of Lemma 16.5.4(=), show that we have 


Ev(gh) = G(h~*) Ev(g). Since G(y~*) = y(y~") for all y € T, this 
implies that €, is right ['-equivariant. 


In the notation of the proof of Lemma 16.5.4(=-), show that we have 
Es(g)v = 9° &v, where the action of G on Sect(€,) is defined by (g - 


Ey) (x) = Ey(g7'z), as usual. 


Prove Lemma 16.5.5. 


[Hint: By choosing V of minimal dimension, we may assume it is an irre- 
ducible G-module, so the evaluation map is either 0 or injective. It cannot 
be 0, and then it must also be surjective, since y is irreducible.| 


Prove Lemma 16.5.7 under that additional assumption that we have 
G = G, X Gg, where G; and G2 are noncompact (and semisimple). 
[Hint: Let L; = G; for i = 1, 2.] 


Prove the conclusion of Lemma 16.5.7 for G = SL(3, R). 


[Hint: A unipotent elementary matrix is a matrix with 1’s on the di- 
agonal and only one nonzero off-diagonal entry. Every element of SL(3,R) 
is a product of < 10 unipotent elementary matrices, and any such matrix 
is contained in a subgroup isogenous to SL(2,R) that has a 1-dimensional 
intersection with A.] 


Prove Lemma 16.5.7. 


Let H be a subgroup of G. Show that if 0, and o2 are H-invariant, 
continuous sections of €,, and there is some x € G/T, such that 

e Hz is dense in G/T and 

e 01 (x) = 92(z), 
then 01 = 0». 


Let H be a subgroup of G, and assume H has a dense orbit in G/T. 
Show the space of H-invariant, continuous sections of €, has finite 
dimension. 


Let H be a subgroup of G. Show that if o is an H-invariant section 
of €,, and c is an element of G that centralizes H, then o -c is also 
H-invariant. 
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#13. Prove Lemma 16.5.8 without assuming that V is 1-dimensional. 
[Hint: Fix « € G. For c € Ccg(H) and o € V, define T: V > R” by 
T(é) = €(), and note that (co)(hz) = T(h~' +c) for allh € H. If Hal is 
dense, this implies that co is determined by o and T. So dim(Ca(H)-V) < 
(dim V) - (dim Hom(V, R”)).] 


#14. Prove Lemma 16.5.10 in the special case where V = Ro is the span of 
an H-invariant measurable section. 
[Hint: This is similar to Lemma 16.5.8, but use the fact that H is ergodic on 
G/T. 

#15. Prove Lemma 16.5.10 (without assuming dim V = 1). 
[Hint: This is similar to Exercise 13.] 


#16. Prove Theorem 16.5.1. 


§16.6. An A-invariant section 


This section sketches the proof of the following result, which completes the 
proof of Theorem 16.5.1 (under the assumption that rankp G > 2). 


(16.6.1) Key Fact. For some n, there is an embedding of H in SL(n,R), 
such that 
1) the associated representation py: T + H C SL(n,R) ts irreducible, and 


2) there exists a nonzero A-invariant o € Sectmeas(Ey)- 
Remark 16.5.3 allows us to restate this as follows: 


(16.6.1') Key Fact. For some embedding of H in SL(n,R), 
1) A acts irreducibly on R”, and 


2) there exists a T’-equivariant, measurable function €: G/A — R” (and 
€ is nonzero). 


In this form, the result is closely related to the following consequence of 
amenability (from Chapter 12). For simplicity, it is stated only for the case 
G = SL(8,R). 

(12.6.2’) Proposition (Furstenberg). Jf 

e G=SL(3,R), 


* kK 
e P=} xx| CG, and 
* 
e T acts continuously on a compact metric space X , 


then there is a Borel measurable map W: G/P — Prob(X), such that w is 
essentially T-equivariant. 


For convenience, let W = R”. There are 3 steps in the proof of Key 
Fact 16.6.1’: 
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1) (amenability) Letting X be the projective space P(W), which is com- 
pact, Proposition 12.6.2’ provides a T-equivariant, measurable map 
€: G/P > Prob(P(W)). 


2) (proximality) The representation of [ on W induces a representation 


of T on any exterior power he W. By replacing W with an appro- 
priate subspace of such an exterior power, we may assume there is 
some y € I, such that y has a unique eigenvalue of maximal absolute 
value (see Exercise 1). Therefore, the action of 7 on P(W) is “prox- 
imal” (see Lemma 16.7.3). The theory of proximality (discussed in 


Section 16.7) now tells us that the [-equivariant random map € must 
actually be a well-defined map into P(W) (see Corollary 16.7.10). 


3) (algebra trick) We have a I-equivariant map é: G/P — P(W). By the 
same argument, there is a T-equivariant map ¢*: G/P — P(W*), where 
W* is the dual of W. Combining these yields a I-equivariant map 

E:G/P x G/P > P(W @W*)& P(End(W)). 
We can lift € to a well-defined map 
£: G/P x G/P > End(W), 

by specifying that trace (€ (x)) = | (see Exercise 2). Since the action of [ 
on End(W) is by conjugation (see Exercise 3) and the trace of conjugate 
matrices are equal, we see that € is [-equivariant (see Exercise 4). 

Finally, note that there is a G-orbit in G/P x G/P whose comple- 
ment is a set of measure 0, and the stabilizer of a point is (conjugate to) 
the group A of diagonal matrices (see Exercises 5 and 6). Therefore, 


after discarding a set of measure 0, we may identify G/P x G/P with 
G/A, so €: G/A — End(W). 


Exercises for 816.6. 


#1. Let 
e » be a semisimple element of I’, such that some eigenvalue of ¥ is 
not of absolute value 1. 
e X1,...,Ax be the eigenvalues of y (with multiplicity) that have 
maximal absolute value. 
eW=/\'w. 
Show that, in the representation of [ on W’, the element 7 has a unique 
eigenvalue of maximal absolute value. 


#2. Let €: G/P — W and é*: G/P — W* be well-defined, measurable lifts 
of & and &* 
a) Show, for ae. x,y € G/P, that E(x) is not in the kernel of the 
linear functional €*(y). 


16.7. A QUICK LOOK AT PROXIMALITY 339 


b) Show, for a.e. x,y € G/P, that, under the natural identification of 
W ®W* with End(W), we have 


trace(E(x) @ E*(y)) #0. 
c) Show € can be lifted to a well-defined measurable function €: G'/P x 
G/P — End(W), such that trace(€(x,y)) = 1, for ae. x,y € G/P. 
[Hint: T acts irreducibly on W, and ergodically on G/P x G/P.] 


#3. Show that the action of T on End(W) = W @ W* is given by conjuga- 
tion: B()T = ey) T oly). 
#4. Show that € is T-equivariant. 


#5. Recall that a flag in R® is a pair (¢, II), where 
e ¢ is a line through the origin (in other words, a 1-dimensional 
linear subspace), and 
e II is a plane through the origin (in other words, a 2-dimensional 
linear subspace), such that 
efcll. 
Show: 
a) SL(3,R) acts transitively on the set of all flags in R°, and 
b) the stabilizer of any flag is conjugate to the subgroup P of Propo- 
sition 12.6.2’. 
Therefore, the set of flags can be identified with G/P. 


#6. Two flags (¢1,11,) and (¢2, 2) are in general position if 
Ly ¢ IIs, and by ¢ Tl, a 

Letting G be the subset of G/P x G/P corresponding to the pairs of 
flags that are in general position, show: 

a) SL(3,R) is transitive on G, 

b) the stabilizer of any point in G is conjugate to the group of diagonal 

matrices, and 
c) the complement of G has measure zero in G/P x G/P. 


[Hint: For (a) and (b), identify G with the set of triples (¢1, 22, @3) of lines 
that are in general position, by letting 23 = 11 N H2.] 


§16.7. A quick look at proximality 


(16.7.1) Assumption. Assume 
1) Tc SL(Z,R), 
2) every finite-index subgroup of I is irreducible on R‘, and 


3) there exists a semisimple element 7 € I, such that 7 has a unique 
eigenvalue \ of maximal absolute value (and the eigenvalue is simple, 
which means the corresponding eigenspace is 1-dimensional). 


(16.7.2) Notation. 
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1) Let 0 be an eigenvector associated to the eigenvalue \. 


2) For convenience, let W = R*. 


(16.7.3) Lemma (Proximality). The action of T on P(W) is proximal. This 
means that, for every [wi], |\w2] € P(W), there exists a sequence {yn} in T, 
such that d([Y¥n(w1)], [Yn(we)]) + 0 as n > ov. 


Proof. Assume, to simplify the notation, that all of the eigenspaces of ¥ are 
orthogonal to each other. Then, for any w € W\ 3+, we have 7"[w] > [0], as 
n — oo (see Exercise 1). Since the finite-index subgroups of I act irreducibly, 
there is some y € I’, such that y(w1), y(w2) ¢ 0+ (see Exercise 2). Therefore, 


d(7”"y((wi]), ¥"y([we])) > d([a], [o]) = 0, 
as desired. Og 


In the above proof, it is easy to see that the convergence ¥”|w] — [0] is 
uniform on compact subsets of W \ w+ (see Exercise 3). This leads to the 
following stronger assertion (see Exercise 4): 


(16.7.4) Proposition (Measure proximality). Let be any probability mea- 
sure on P(W). Then there is a sequence {y,} in IT, such that (yn)xp converges 
to a delta-mass supported at a single point of P(W). 


It is obvious from Proposition 16.7.4 that there is no T-invariant prob- 
ability measure on P(W). However, it is easy to see that there does exist 
a probability measure that is invariant “on average? in the following sense 
(see Exercise 5): 


(16.7.5) Definition. 


1) Fix a finite generating set S of [, such that S~! = S$. A probability 
measure 1 on P(W) is stationary for S if 


a5 = 


yES 


2) More generally, let v be a probability measure on T. A probability 
measure jt on P(W) is v-stationary if v* 4 = pu. More concretely, this 
means 

S- u(y) Yet = b 

ver 
(Some authors call ys “harmonic? rather than “stationary” ) 
(16.7.6) Remark. A random walk on P(W) can be defined as follows: Choose 
a sequence 71, 72,-..- of elements of [’, independently and with distribution v. 


Also choose a random 29 € P(W), with respect to some probability distribu- 
tion w on P(W). Then xz, € P(W) is defined by 


Ln = V12°-°* Yn(Zo), 
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so {xz,} is a random walk on P(W). A stationary measure represents a “sta- 
tionary state” (or equilibrium distribution) for this random walk. Hence the 
terminology. 


If the initial distribution jz is stationary, then a basic result of probability 
(the “Martingale Convergence Theorem” ) implies, for almost every sequence 
{Yn}, that the resulting random walk {z,,} has a limiting distribution; that 
is, 


for a.e. {Yn}, (V172°++ Yn)«H converges in Prob(P(W)). 


This theorem applies to stationary measures on any space, with no need for 
Assumption 16.7.1. By using measure proximality, we will now show that the 
limiting distribution is almost always a point mass. 


(16.7.7) Definition. A closed, nonempty, [-invariant subset of P(W) is min- 
imal if it does not have any nonempty, proper, closed, T-invariant subsets. 
(Since P(W) is compact, the finite-intersection property implies that every 
nonempty, closed, T-invariant subset of P(W) contains a minimal set.) 


(16.7.8) Theorem (Mean proximality). Assume 
e vy is a probability measure on T, such that v(y) > 0 for all y €T, 


e C is a minimal closed, T-invariant subset of P(W), and 


e yu is av-stationary probability measure on C. 
Then, for a.e. {yn} € T°, there exists ce P(W), such that 
(Y17¥2°°* Yn) «("b) 4 de as N > OO. 


Proof. It was mentioned above that the Martingale Convergence Theorem 
implies (y172--:Yn)«(14) has a limit (almost surely), so it suffices to show there 
is (almost surely) a subsequence that converges to a measure of the form 6,.. 

Proposition 16.7.4 provides a sequence {g,} of elements of T', such that 
(Gk )xl — Oc, for some co € C. To extend this conclusion to a.e. sequence 
{yn}, we use equicontinuity: we may write I is the union of finitely many 
sets £1,...,&,, such that each FE; is equicontinuous on some nonempty open 
subset U; of C (see Exercise 6). 

The minimality of C implies [U; = C for every 7. Then, by compactness, 
there is a finite subset F = {f1,...,f,} of [, such that FU; = C for each i. 
Since v(y) > 0 for every y € TI, there is (almost surely) a subsequence {7n, } 
of {yn}, such that, for every k, we have 


YrrtlYnpt2°°° Yne+j = pe gx forl<j<s. 
By passing to a subsequence, we may assume there is some 2, such that 
V172°** Yn, © Ei, for all k. 


To simplify the notation, let us assume 7 = 1. 
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Since FU; = C, we may write co = f;u, for some f; € F and u € Uj. 
Then 
(Yna tmnt? meta lel = (F5* Ge)aY 4 (FF ') x50 = Spm tog = Su 


By passing to a subsequence, we may assume (7172°-:Yn,)U converges to 
some c € C’. Then, since 7172°-- Yn, € £1, and Ej is equicontinuous on Uj, 
this implies 


(172° ++ Yneta)aY = (172° Ine) (Cnet sal Yaxty)ev) — de. OU 


In order to apply this theorem, we need a technical result, whose proof 
we omit: 
(16.7.9) Lemma. There exist: 

e a probability measure v on T, and 

e av-stationary probability measure w on G/P, 
such that 

1) the support of v generates T, and 

2) pu is in the class of Lebesgue measure. (That is, 4 has exactly the same 


sets of measure 0 as Lebesgue measure does.) 


Also note that if C is any nonempty, closed, T-invariant subset of P(W), 
then Prob(C) is a nonempty, compact, convex T-space, so Furstenberg’s 
Lemma (12.6.1) provides a T-equivariant map €: G/P — Prob(C). This 
observation allows us to replace P(W) with a minimal subset. 

We can now fill in the missing part of the proof of Key Fact 16.6.1’: 


(16.7.10) Corollary. Suppose 
e C is a minimal closed, T-invariant subset of P(W), and 
e €: G/P > Prob(C) is -equivariant. 
Then E(a) is a point mass, for a.e. & € G/P. 
Hence, there exists €: G/P — P(W), such that €(x) = Dé (my for a.e. 
xeEG/P. 
Proof. Let 
e dp(w) = {52 | « € P(W) } be the set of all point masses in the space 
Prob(P(W)), and 


e 4 be a v-stationary probability measure on G'/P that is in the class of 
Lebesgue measure (see Lemma 16.7.9). 


We wish to show €(x) € dpcw), for a.e. « € G/P. In other words, we wish to 


show that €,(j) is supported on Op(w)- 
Note that: 


e dp(w) is a closed, T-invariant subset of Prob(P(W)), and 
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e because € is I-equivariant, we know that €,() is a v-stationary prob- 
ability measure on Prob(P(W)). 

Roughly speaking, the idea of the proof is that almost every trajectory of 

the random walk on Prob(P(W)) converges to a point in dpcy) (see 16.7.8). 


On the other hand, being stationary, €,(j) is invariant under the random 
walk. Therefore, we conclude that €,(j:) is supported on dpry), as desired. 
We now make this rigorous. Let 


ene iE phe) nla) 


SO JWp(w) is a stationary probability measure on P(W). By mean proximality 
(16.7.8), we know, for a.e. (71, 72,.--) € 1°, that 

d((7172°++Yn)«(Hew)), Sew) ) “FP 0. 
For any € > 0, this implies, by using the definition of jupry), that 


u({2€ G/P | d(nre---nE(2)), dey) >€}) “PF 0. 
Since € is I-equivariant, we may 


replace 7172°-+Yn(E(x)) with €(y172---In2). 

Then, since the measure on G/P is stationary, we can delete y172°--Yn, 
and conclude that 

u{xeG/P| d(&(x), daw) >e} “—F 0 (16.7.11) 

(see Exercise 7). Since the left-hand side does not depend on n, but tends 

to 0 as n — o, it must be 0. Since € > 0 is arbitrary, we conclude that 

E(x) € dp(w) for a.e. x, as desired. 


Exercises for §16.7. 


#1. In the notation of Lemma 16.7.3, show, for every w € W \ 3+, that 
7" |w] > [v], as n > oo. 


#2. Show, for any nonzero w 1, w2 € W, that there exists y € I, such that 
neither yw, nor yw is orthogonal to V. 


[Hint: Let H be the Zariski closure of T in SL(é,R), and assume, by passing 
to a finite-index subgroup, that H is connected. Then W; = {h € H | hw; € 


v~ } is a proper, Zariski-closed subset. Since I is Zariski dense in H, it must 
intersection the complement of W; U W2.] 


#3. Show that the convergence in Exercise 1 is uniform on compact subsets 
of W\ ot. 


#4. Prove Proposition 16.7.4. 
[Hint: Show MaX,cp(w)veFa v(w) = 11] 


#5. Show there exists a stationary probability measure on P(W). 
(Hint: Kakutani-Markov Fixed-Point Theorem (cf. 12.2.1).] 
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#6. Let C be a subset of P(IR”), and assume that C is not contained in any 

(n — 1)-dimensional hyperplane. Prove that GL(n,R) is the union of 
finitely many sets E),...,&,, such that each FE; is equicontinuous on 
some nonempty open subset U; of C. 
[Hint: Each matrix T € Matnxn(R) induces a well-defined, continuous func- 
tion T: (P(R”) \ P(ker T)) — P(R”). If Br is a small ball around T in 
P(Matnxn(R)), then Br is equicontinuous on an open set. A compact set 
can be covered by finitely many balls.] 


#7. Establish (16.7.11). 
[Hint: Since p is stationary, the map 
Tr x G/P + G/P: ((715%2)-+-), 0)  W172°° Ine 


is measure preserving. | 


#8. Show that if P(W) is minimal, then the [-equivariant measurable map 
€: G/P — P(W) is unique (a.e.). 
[Hint: If w is another -equivariant map, then define €: G/P — Prob (P(W)) 
by €() = 3 (dee) + du(e)): 


§16.8. Groups of real rank one 


The Margulis Superrigidity Theorem (16.1.4) was proved for groups of real 
rank at least two in Section 16.5. Suppose, now, that rankg G = 1 (and G 
has no compact factors). The classification of simple Lie groups tells us that 
G is isogenous to the isometry group of either: 

e real hyperbolic space 9”, 

e complex hyperbolic space C9”, 


e quaternionic hyperbolic space HS)”, or 


e the Cayley hyperbolic plane OS)? (where O is the ring of “Cayley num- 
bers” or “octonions” ) 


(cf. Theorem 8.3.1). Assumption 16.1.4(ii) rules out §)” and CS)”, so, from 
the connection of superrigidity with totally geodesic embeddings (cf. Subsec- 
tion 16.2(iii)), the following result completes the proof: 
(16.8.1) Theorem. Assume 

e X = Hh” or ON?, 

e I is a torsion-free, discrete group of isometries of X, such that T\X 

has finite volume, 

e X’ is an irreducible symmetric space of noncompact type, and 

e »: T > Isom(X’)° is a homomorphism whose image is Zariski dense. 
Then there is a map f: X — X', such that 

1) f(X) is totally geodesic, and 

2) f is p-equivariant, which means f (yx) = p(y): f(x). 
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Brief outline of proof. Choose a (nice) fundamental domain F for the 
action of [ on X. For any y-equivariant map f: X — X’, define the energy 
of f to be the L?-norm of the derivative of f over F. Since f is y-equivariant, 
and the groups [' and y(I) act by isometries, this is independent of the choice 
of the fundamental domain F. 

It can be shown that this energy functional attains its minimum at some 
function f. The minimality implies that f is harmonic. Then, by using the 
geometry of X and the negative curvature of X’, it can be shown that f must 
be totally geodesic. 


Notes 


This chapter is largely based on |[6, Chaps. 6 and 7]. (However, we usually 
replace the assumption that rankp G > 2 with the weaker assumption that G 
is not SO(1,m) x K or SU(1,m) x K. (See [10, Thm. 5.1.2, p. 86] for a different 
exposition that proves version (16.1.6) for rankg G > 2.) In particular: 

e For ranky G > 2, our statement of the Margulis Superrigidity Theorem 

(16.1.6) is a special case of [6, Thm. 7.5.6, p. 228]. 

e For rankg G > 2, Corollary 16.1.7 is stated in [6, Thm. 9.6.15(i)(a), 

pease 

e For rankgG > 2, Theorem 15.1.2 is stated in [6, Thm. 7.7.5, p. 254]. 

(See [8, Thm. B] for the general case, which does not follow from su- 
perrigidity. ) 

e Lemma 16.5.5 is a version of [6, Prop. 4.6, p. 222] 

e Lemma 16.5.7 is a version of [6, Lem. 7.5.5, p. 227]. 

e Lemma 16.5.10 is [6, Prop. 7.3.6, p. 219]. 

e Key Fact 16.6.1’ is adapted from |[6, Thm. 6.4.3(b)2, p. 209]. 

e Theorem 16.7.8 is based on [6, Prop. 6.2.13, pp. 202-203]. 

e Lemma 16.7.9 is taken from [6, Prop. 6.4.2, p. 209]. 

e Corollary 16.7.10 is based on |6, Prop. 6.2.9, p. 200]. 

e Exercise 16.7#6 is [6, Lem. 6.3.2, p. 203). 

Long before the general theorem of Margulis for groups of real rank > 2, 
it was proved by Bass, Milnor, and Serre [2, Thm. 61.2] that the Congruence 
Subgroup Property implies SL(n, Z) is superrigid in SL(n, R). 

“Geometric superrigidity” is the study of differential geometric versions 
of the Margulis Superrigidity Theorem, such as Proposition 16.2.5. (See, for 
example, [7].) 

Details of the derivation of arithmeticity from superrigidity (Section 16.3) 
appear in [6, Chap. 9] and [10, §6.1]. 

Proofs of the Commensurability Criterion (16.3.3) and Commensura- 
tor Superrigidity (16.3.4) can be found in [1], [6, §9.2.11, pp. 305ff, and 
Thm. 7.5.4, pp. 226-227], and [10, §6.2]. 
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Much of the material in Section 16.7 is due to Furstenberg [4]. 

The superrigidity of lattices in the isometry groups of Hf)” and OS)? 
(see Section 16.8) was proved by Corlette [3]. The p-adic version (16.3.2) for 
these groups was proved by Gromov and Schoen [5]. 
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Chapter 17 


Normal Subgroups of I 


This chapter presents a contrast between the lattices in groups of real rank 1 
and those of higher real rank: 
e If rankp G = 1, then I has many, many normal subgroups, so I is very 
far from being simple. 


e If rankgG > 1 (and T is irreducible), then T' is simple modulo finite 
groups. More precisely, if N is any normal subgroup of I, then either 
N is finite, or ['/N is finite. 


§17.1. Normal subgroups in lattices of real rank > 2 


(17.1.1) Theorem (Margulis Normal Subgroups Theorem). Assume 
e rankgpG > 2, 
e Tis an irreducible lattice in G, and 
e N is a normal subgroup of I. 

Then either N is finite, or T/N is finite. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, Tis a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: LAist eomnrene: org/publicdomain/zero/1.0/ 
Main-prerequisi ability (Furstenberg’s Lemma 
(12.6. Chapter 13). Also used: the 
o-algebra of Borel sets modulo sets of measure 0 (Section 14.4) and 
manifolds of negative curvature. 
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(17.1.2) Example. Every lattice in SL(3, R) is simple, modulo finite groups. 
In particular, this is true of SL(3, Z). 


(17.1.3) Remarks. 


1) 


2) 


3) 


4) 


The hypotheses on G and [ are essential: 

(a) If rankgG = 1, then every lattice in G has an infinite normal 
subgroup of infinite index (see Theorem 17.2.1). 

(b) If I is reducible (and G has no compact factors), then [ has an 
infinite normal subgroup of infinite index (see Exercise 2). 


The finite normal subgroups of [ are easy to understand (if [I is ir- 
reducible): the Borel Density Theorem implies that they are the sub- 
groups of the finite abelian group ['M Z(G) (see Corollary 4.5.4). 

If T’ is infinite, then I has infinitely many normal subgroups of finite 
index (see Exercise 5), so [is not simple. 


In most cases, the subgroups of finite index are described by the “Con- 
gruence Subgroup Property’ For example, if [ = SL(3,Z), then the 
principal congruence subgroups are obvious subgroups of finite index 
(see Exercise 4.87#43). More generally, any subgroup of [ that con- 
tains a principal congruence subgroup obviously has finite index. The 
Congruence Subgroup Property is the assertion that every finite-index 
subgroup is one of these obvious ones. It is true for SL(n, Z), whenever 
n > 3, and a similar (but slightly weaker) statement is conjectured to 
be true whenever ranky G > 2 and T is irreducible. 


The remainder of this section presents the main ideas in the proof of 
Theorem 17.1.1. In a nutshell, we will show that if N is an infinite, normal 
subgroup of I, then 


1) 
2) 


T'/N has Kazhdan’s property (7), and 
T'/N is amenable. 


This implies that ['/N is finite (see Corollary 13.1.5). 

In most cases, it is easy to see that [/N has Kazhdan’s property (because 
I has the property), so the main problem is to show that ['/N is amenable. 
This amenability follows easily from an ergodic-theoretic result that we will 
now describe. 


(17.1.4) Assumption. To minimize the amount of Lie theory needed, let us 
assume 


G=SL(,R). 


(17.1.5) Notation. Let 


| 
Ps ‘le: 7% C SL(3, R) = G. 
E * .| 


Hence, P is a (minimal) parabolic subgroup of G. 
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Note that if Q is any closed subgroup of G that contains P, then the 
natural map G/P — G/Q is G-equivariant, so we may say that G/Q is a 
G-equivariant quotient of G/P. Conversely, it is easy to see that spaces of 
the form G/Q are the only G-equivariant quotients of G/P. In fact, these are 
the only quotients even if we only assume that quotient map is equivariant 
almost everywhere (see Exercise 6). 

Furthermore, since [ is a subgroup of G, it is obvious that every G- 
equivariant map is T-equivariant. Conversely, the following surprising result 
shows that every [-equivariant quotient of G/P is G-equivariant (up to a set 
of measure 0): 


(17.1.6) Theorem (Margulis). Suppose 
e rankypG > 2, 
e P is a minimal parabolic subgroup of G, 
e I is irreducible, 
e T acts by homeomorphisms on a compact, metrizable space Z, and 
ew: G/P > Z is essentially T-equivariant (and measurable). 


Then the action of T on Z is measurably isomorphic to the natural action 
of T on G/Q (a.e.), for some closed subgroup Q of G that contains P. 


(17.1.7) Remark. 


1) Perhaps we should clarify the choice of measures in the statement of 
Theorem 17.1.6. (A measure class on G'/P is implicit in the assumption 
that ~ is essentially T-equivariant. Measure classes on Z and G/Q are 
implicit in the “(a.e.)” in the conclusion of the theorem.) 

(a) Because G/P and G/Q are C™ manifolds, Lebesgue measure sup- 
plies a measure class on each of these spaces. The Lebesgue class 
is invariant under all diffeomorphisms, so, in particular, it is G- 
invariant. 

(b) There is a unique measure class on Z for which w is measure-class 
preserving (see Exercise 7). 


2) The proof of Theorem 17.1.6 will be presented in Section 17.3. It may 
be skipped on a first reading. 


Proof of Theorem 17.1.1. Let N be a normal subgroup of I’, and assume 
N is infinite. We wish to show I'/N is finite. Let us assume, for simplicity, 
that [ has Kazhdan’s Property (7'). (For example, this is true if G = SL(3, R), 
or, more generally, if G is simple (see Corollary 13.4.2).) Then ['/N also has 
Kazhdan’s Property (T') (see Proposition 13.1.7), so it suffices to show that 
I'/N is amenable (see Corollary 13.1.5). 
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Suppose ['/N acts by homeomorphisms on a compact, metrizable space X. 
In order to show that ['/N is amenable, it suffices to find an invariant prob- 
ability measure on X (see Theorem 12.3.1(3)). In other words, we wish to 
show that [ has a fixed point in Prob(X). 


e Because P is amenable, there is an (essentially) [-equivariant measur- 
able map =: G/P — Prob(X) (see Corollary 12.6.2). 


e From Theorem 17.1.6, we know there is a closed subgroup Q of G, such 
that the action of [ on Prob(X) is measurably isomorphic (a.e.) to the 
natural action of [ on G/Q. 


Since N acts trivially on X, we know it acts trivially on Prob(X) = G/Q. 
Hence, the kernel of the G-action on G'/Q is infinite (see Exercise 10). How- 
ever, G' is simple (modulo its finite center), so this implies that the action of G 
on G/Q is trivial (see Exercise 11). (It follows that G/Q is a single point, so 
Q = G, but we do not need quite such a strong conclusion.) Since [ C G, 
then the action of [ on G/Q is trivial. In other words, every point in G/Q 
is fixed by T. Since G/Q & Prob(X) (a.e.), we conclude that almost every 
point in Prob(X) is fixed by T; therefore, [ has a fixed point in Prob(X), as 
desired. O 


(17.1.8) Remark. The proof of Theorem 17.1.1 concludes that “almost every 
point in Prob(X) is fixed by [7 so it may seem that the proof provides not 
just a single T-invariant measure, but many of them. This is not the case: 
The proof implies that w is essentially constant (see Exercise 12). This means 
that the T-invariant measure class [7),j] is supported on a single point of 
Prob(X), so “a.e”? means only one point. 


Exercises for 817.1. 


#1. Assume 

e G is not isogenous to SO(1,7) or SU(1, 7), for any n, 

e IT is irreducible, and 

e G has no compact factors. 
In many cases, Kazhdan’s property (7) implies that the abelianization 
T/|,T] of I is finite (see Corollary 13.4.3(2)). Use Theorem 17.1.1 to 
prove this in the remaining cases. (We saw a different proof of this in 
Exercise 16.1#3.) 


#2. Verify Remark 17.1.3(1b). 
[Hint: Proposition 4.3.3.] 


#3. Suppose I is a lattice in SL(3,R). Show that [ has no nontrivial, finite, 
normal subgroups. 


#4. Suppose T is an irreducible lattice in G. Show that I’ has only finitely 
many finite, normal subgroups. 


£9. 


#6. 


#7. 


#8. 


#9. 


4£10. 


#11. 
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Show that if T is infinite, then it has infinitely many normal subgroups 
of finite index. 
[Hint: Exercise 4.8#9.] 


Suppose 
e H isaclosed subgroup of G, 
e G acts continuously on a metrizable space Z, and 
ew: G/H — Z is essentially G-equivariant (and measurable). 
Show the action of G on Z is measurably isomorphic to the action of G 
on G/Q (a.e.), for some closed subgroup Q of G that contains H. More 
precisely, show there is a measurable ¢: Z > G/Q, such that: 
a) dis measure-class preserving (i.e., a subset A of G/Q has measure 0 
if and only if its inverse image @~1(A) has measure 0), 
b) ¢ is one-to-one (a.e.) (i.e., d is one-to-one on a conull subset of Z), 
and 
c) ¢ is essentially G-equivariant. 


[Hint: See Remark 17.1.7(1) for an explanation of the measure classes to be 
used on G/H, G/Q, and Z. For each g € G, the set {x € G/H | V(gx) = 
g(a) } is conull. By Fubini’s Theorem, there is some zo € G/H, such that 
W(gx0) = g- (xo) for a.e. g. Show the G-orbit of w(xo) is conull in Z, and 


let Q = Stabe (w(xo)).] 


Suppose 
e Ww: Y — Z is measurable, and 
e {1; and fg are measures on Y that are in the same measure class. 
Show: 
a) The measures ~,(ui1) and ~.(~2) on Z are in the same measure 
class. 
b) For any measure class on Y, there is a unique measure class on Z 
for which ~ is measure-class preserving. 


In the setting of Theorem 17.1.6, show that w is essentially onto. That 
is, the image ~(G/P) is a conull subset of Z. 

[Hint: By choice of the measure class on Z, we know that 7 is measure-class 
preserving. | 

Let G = SL(3, R) and T = SL(3, Z). Show that the natural action of T 
on R?/Z? = T? is a T-equivariant quotient of the action on R®, but is 
not a G-equivariant quotient. 


In the proof of Theorem 17.1.1, we know that Prob(X) = G/Q (a.e.), 
so each element of N fixes a.e. point in G/Q. Show that N acts trivially 
on G/Q (everywhere, not only a.e.). 


[Hint: The action of N is continuous.] 


In the notation of the proof of Theorem 17.1.1, show that the action 
of G on G/Q is trivial. 
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[Hint: Show that the kernel of the action of G on G/Q is closed. You may 
assume, without proof, that G is an almost simple Lie group. This means 
that every proper, closed, normal subgroup of G is finite.] 

#12. In the setting of the proof of Theorem 17.1.1, show that w is constant 
(a.e.). 
[Hint: The proof shows that a.e. point in the image of w is fixed by G. 
Because w is G-equivariant, and G is transitive on G/P, this implies that w 
is constant (a.e.).] 


§17.2. Normal subgroups in lattices of rank one 


Theorem 17.1.1 assumes rankp G > 2. The following result shows that this 
condition is necessary: 


(17.2.1) Theorem. /f rankpG = 1, then T has a normal subgroup N, such 
that neither N nor T/N is finite. 


Proof (assumes familiarity with manifolds of negative curvature). For sim- 
plicity, assume: 

e I is torsion free, so it is the fundamental group of the locally symmetric 

space M =I\G/K (where K is a maximal compact subgroup of G). 

e M is compact. 

e The locally symmetric metric on M has been normalized to have sec- 
tional curvature < —1. 

The injectivity radius of M is > 2. 

There are closed geodesics y and \ in M, such that length(A) > 27 and 
dist(y, A) > 2. 

The geodesics y and X represent (conjugacy classes of) nontrivial elements 
7 and d of the fundamental group [ of M. Let N be the smallest normal 
subgroup of I that contains A: 

It suffices to show that ¥” is nontrivial in '/N, for every n € Z* (see Ex- 
ercise 1). Construct a CW complex M by gluing the boundary of a 2-disk D) 
to M along the curve \, so the fundamental group of M is T/N. 

We wish to show that 7” is not null-homotopic in M. Suppose there is a 
continuous map f: D? + M, such that the restriction of f to the boundary 
of D* is y™. Let 


Do = f-*(M), 
so D2 is a surface of genus 0 with some number k of boundary curves. We 
may assume f is minimal (ie., the area of D? under the pull-back metric 
is minimal). Then D2 is a surface of curvature K(x) < —1 whose boundary 
curves are geodesics. Note that f maps 


e one boundary geodesic onto y”, and 


e the other k — 1 boundary geodesics onto multiples of A. 
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This yields a contradiction: 


Qn(k — 2) = —20 X(D2) (see Exercise 2) 
=- i tae (Gauss-Bonnet Theorem) 
Ds 
> | ldx 
Ds 
> (k — 1) length()) (see Exercise 3) 
> Qn(k — 1). 


(17.2.2) Remark. Perhaps the simplest example of Theorem 17.2.1 is when 
G = SL(2,R) and I is a free group (see Remark 6.1.6). In this case, it is easy 
to find a normal subgroup N, such that N and ['/N are both infinite. (For 
example, we could take N = [P,T].) 


There are numerous strengthenings of Theorem 17.2.1 that provide infi- 
nite quotients of [ with various interesting properties (if rankgG = 1). We 
will conclude this section by briefly describing just one such example. 

A classical theorem of Higman, Neumann, and Neumann states that every 
countable group can be embedded in a 2-generated group. Since 2-generated 
groups are precisely the quotients of the free group F2 on 2 generators, this 
means that F2 is “SQ-universal” in the following sense: 


(17.2.3) Definition. [ is $Q-universal if every countable group is isomorphic 
to a subgroup of a quotient of T. (The letters “SQ” stand for “subgroup- 
quotient?” ) 

More precisely, the SQ-universality of [ means that if A is any countable 
group, then there exists a normal subgroup N of I’, such that A is isomorphic 
to a subgroup of ['/N. 


(17.2.4) Example. F,, is SQ-universal, for any n > 2 (see Exercise 4). 


SQ-universality holds not only for free groups, which are lattices in 
SL(2, IR) (see Remark 6.1.6), but for any other lattice of real rank one: 


(17.2.5) Theorem. /f rankg G =1, then T is SQ-universal. 


(17.2.6) Remark. Although the results in this section have been stated only 
for [’, which is a lattice, the theorems are valid for a much more general class 
of groups. This is because normal subgroups can be obtained from an assump- 
tion of negative curvature (as is illustrated by the proof of Theorem 17.2.1). 
Indeed, Theorems 17.2.1 and 17.2.5 remain valid when I is replaced with any 
group that is Gromov hyperbolic (see Definition 10.2.1), or even “relatively” 
hyperbolic (and not commensurable to a cyclic group). 
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Exercises for §17.2. 


#1. Suppose 
e y and X are nontrivial elements of I, 
e I is torsion free, 
e N is anormal subgroup of IT, 
e XEN, and 
e 7” ¢ N, for every positive integer n. 
Show that neither N nor I'/N is infinite. 


#2. Show that the Euler characteristic of a 2-disk with k — 1 punctures is 
2—k. 


#3. In the notation of the proof of Theorem 17.2.1, show 
| ldx > (k — 1) length()). 
D2 


0) 


(Hint: All but one of the boundary components are at least as long as 4, 
and a boundary collar of width 1 is disjoint from the collar around any other 
boundary component.| 


#4. Justify Example 17.2.4. 


[Hint: You may assume the theorem of Higman, Neumann, and Neumann on 
embedding countable groups in 2-generated groups.| 


§17.3. [-equivariant quotients of G/P (optional) 


In this section, we explain how to prove Theorem 17.1.6. However, we will 
assume G = SL(2,R) x SL(2, R), for simplicity. 

The space Z is not known explicitly, so it is difficult to study directly. 
Instead, as in the proof of the ergodic decomposition in Section 14.4, we will 
look at the o-algebra 6(Z) of Borel sets, modulo the sets of measure 0. (We 
will think of this as the set of {0, 1}-valued functions in £°°(Z), by identifying 
each set with its characteristic function.) Note that w induces a I-equivariant 
inclusion 


w*: B(Z) > B(G/P) 
(see Exercise 1). Via the inclusion ~*, we can identify B(Z) with a sub-o- 
algebra of B(G/P): 
B(Z) C B(G/P). 
In order to establish that Z is a G-equivariant quotient of G/P, we wish to 
show that 6(Z) is G-invariant (see Exercise 2). Therefore, Theorem 17.1.6 
can be reformulated as follows: 


(17.1.6’) Theorem. Jf B is any T-invariant sub-c-algebra of B(G/P), then 
B is G-invariant. 


To make things easier, let us settle for a lesser goal temporarily: 
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17.3.1) Definition. The trivial Boolean sub-o-algebra of B(G/P) is {0,1} 
the set of constant functions). 


—_~— 


(17.3.2) Proposition. Jf B is any nontrivial, T-invariant sub-c-algebra of 
B(G/P), then B contains a nontrivial G-invariant Boolean algebra. 


(17.3.3) Remark. 
1) To establish Proposition 17.3.2, we will find a characteristic function 
f € B(G/P) \ {0,1}, such that Gf C B. 
2) The proof of Theorem 17.1.6’ is similar: let Bg be the (unique) maximal 


G-invariant Boolean subalgebra of B. If Bg # B, we will find some 
f € B(G/P) \ Ba, such that Gf C B. (This is a contradiction.) 


(17.3.4) Assumption. To simplify the algebra in the proof of Proposi- 
tion 17.3.2, let us assume G = SL(2, R) x SL(2,R). 


(17.3.5) Notation. 
e G=G; xX Go, where G; = G2 = SL(2,R), 
e P= P, x Py», where P; = * .| EGE 


e U=U, x U2, where U; = E Hl (ae af 


e V=V, x Vo, where V; = Bi C Gi, 
e I = some irreducible lattice in G, and 
e B = some [-invariant sub-c-algebra of B(G/P) . 


(17.3.6) Remark. We have G/P = (G/P,) x (G2/P2). Here are two useful, 
concrete descriptions of this space: 


e G/P =RP! x RP! & R? (a.e.), and 

e G/P=V, x V2 (a.e.) (see Exercise 4). 
Note that, if we identify G/P with R? (a.e.), then, for the action of G; 
on G/P, we have 


0 [Fa] Gu) = Woy), and 


o[) i]en= eran 


(see Exercise 3). 


The proof of Proposition 17.3.2 employs two preliminary results. The 
first is based on a standard fact from first-year analysis: 


(17.3.7) Lemma (Lebesgue Differentiation Theorem). Let 
° fe LR"), 
e \ be the Lebesgue measure on R”, and 
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e B,(p) be the ball of radius r centered at p. 


For a.e. p € R”, we have 


lim aa! fdd=fi(p). (17.3.8) 
B,(p) 


70 4(B,(p)) 
Letting n = 1 and applying Fubini’s Theorem yields: 
(17.3.9) Corollary. Let 
e fe £(R’), 


ea= |" al EG, for some k > 1, and 


e 72: R* > {0} x R be the projection onto the y-azis. 
Then, for a.e. v € Vj, 


a” uf converges in measure to (uf) om: as n — oo. 


Proof. Exercise 6. LO 


The other result to be used in the proof of Proposition 17.3.2 is a conse- 
quence of the Moore Ergodicity Theorem: 


(17.3.10) Proposition. For a.e.v €V,, Tu~ta-N is dense in G. 


Proof. Taking inverses, we wish to show aNvIT = G; i.e., the (forward) a-orbit 
of vI is dense in G/T, for a.e. v € Vi. We will show that 
aNogl = G, for ae. g € G, 

and leave the remainder of the proof to the reader (see Exercise 7). 

Given a nonempty open subset O of G/T, let 

E= U a_"O. 
n>0 

Clearly, a~'E C E. Since p(a~'E) = (FE) (because the measure on G/T is 
G-invariant), this implies FE is a-invariant (a.e.). Since the Moore Ergodicity 
Theorem (14.2.4) tells us that a is ergodic on G/T, we conclude that EF = G/T 
(a.e.). This means that, for a.e. g € G, the forward a-orbit of g intersects O. 

Since © is an arbitrary open subset, and G/T is second countable, we 
conclude that the forward a-orbit of a.e. g is dense. O 


Proof of Proposition 17.3.2 for G = SL(2,R) x SL(2,R). Identify 
G/P with R?, as in Remark 17.3.6. Since B is nontrivial, it contains some 
nonconstant f. Now f cannot be essentially constant both on almost every 
vertical line and on almost every horizontal line (see Exercise 8), so we may 
assume there is a non-null set of vertical lines on which it is not constant. 
This means that 


{rev 


(uf) 0 7 is not 
essentially constant 


} has positive measure. 
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Corollary 17.3.9 and Proposition 17.3.10 tell us we may choose v in this set, 
with the additional properties that 


e auf > (vf) 072, and 
e Tv—ta- is dense in G. 
Let f = (vf) 072, so 


auf > f. 
Now, for any g € G, there exist y; € [ and n; — ov, such that 
Gir= iv a”  g. 


Then we have 
Ga vH=9, ef, 
so the [-invariance of 6 implies 
Banf=ga™vf > 9F 
(see Exercise 12). Since G is closed (see Exercise 11), we conclude that g f € B. 


Since g is an arbitrary element of G, this means Gf C B. Also, from the choice 
of v, we know that f = (vf) o 72 is not essentially constant. 


Combining the above argument with a list of the G-invariant Boolean 
subalgebras of B(G/P) yields Theorem 17.1.6’: 


Proof of Theorem 17.1.6’ for G = SL(2,R) x SL(2,R). Let Be be 
the largest G-invariant subalgebra of B, and suppose B 4 Bg. (This will lead 
to a contradiction.) 

It is shown in Exercise 10 that the only G-invariant subalgebras of 
B(G/P) = B(R?) are 

e B(R’), 

e {functions constant on horizontal lines (a.e.) }, 

e {functions constant on vertical lines (a.e.) }, and 

e {0,1}. 
So Be must be one of these 4 subalgebras. 

We know Bg 4 B(R?) (otherwise B = Bg). Also, we know B is nontrivial 
(otherwise 6 = {0,1} = Bg), so Proposition 17.3.2 tells us that Bg 4 {0,1}. 
Hence, we may assume, by symmetry, that 


Ba = { functions constant on vertical lines (a.e.) }. (Ei) 


Since B # Bg, there is some f € B, such that f is not essentially constant on 
vertical lines. Applying the proof of Proposition 17.3.2 yields f, such that 


e Gf CB, so f € Ba, and 
e f is not essentially constant on vertical lines. 
This contradicts (17.3.11). 
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Very similar ideas yield the general case of Theorem 17.1.6, if one is 
familiar with real roots and parabolic subgroups. To illustrate this, without 
using extensive Lie-theoretic language, let us explicitly describe the setup for 
G = SL(8,R). 


Modifications for SL(3, R). 


* 1x* x 1 * 1 
p=|ts F v=| i#] n= | 1 | ve=| 1+] 

* kK 1 1 1 
Note that V = (Vj, V2). 


e There are exactly four subgroups containing P, namely, 


Pe Gn, Pee ie J=™.?), B= | ay. Pi. 


*k OK OK *k OK 
Hence, there are precisely four G-invariant subalgebras of B(G/P). 
Namely, if we identify B(G/P) with B(V), then the G-invariant subal- 
gebras of B(V) are 
{0, 1}, 
right Vj-invariant functions, 
right V2-invariant functions. 


Oo OO O 


(17.3.12) Remark. The homogeneous spaces G/P, and G/P: are RP? 
and the Grassmannian G23 of 2-planes in R® (see Exercise 13). Hence, 
in geometric terms, the G-invariant Boolean subalgebras of B(G'/P) are 
B(G/P), {0,1}, B(RP?), and B(G,3). 


e Let m2 be the projection onto V2 in the natural semidirect product 


1 x 
V =V.% V3, where Vib = 1%) 


€ G, Exercise 14 tells us 


? : ae |" 1 i, P. (17.3.13) 


ee ee 


e A generalization of the Lebesgue Differentiation Theorem tells us, for 
f € B(G/P) = B(V) and ae. v € V5", that 
a” uf converges in measure to (uf) o 7. 


With these facts in hand, it is not difficult to prove Theorem 17.1.6’ under 
the assumption that G = SL(3, R) (see Exercise 15). 


k 
e Fora= k; 
Lyk? 
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Exercises for §17.3. 


#1. 


#2. 


#3. 


4A, 


#5. 


H6. 


#T. 


HH. 


In the setting of Theorem 17.1.6, define y*: B(Z) — B(G/P) by 
w*(f) = fow. Show that ~* is injective and T-equivariant. 


[Hint: Injectivity relies on the fact that ~ is measure-class preserving. | 


In the setting of Theorem 17.1.6, show that if the sub-c-algebra y* (B(Z)) 
of B(G/P) is G-invariant, then Z is a G-equivariant quotient of G/P 
(a.e.). 

[Hint: To reduce problems of measurability, you may pretend that G is count- 
able. More precisely, use Exercise 14.4745 to show that if H is any countable 
subgroup of G that contains I’, then the I’-action can be extended to an action 
of H on Z by Borel maps, such that, for each h € H, we have (hx) = hw(a) 
for a.e. c € G/P.] 


Let G; and P; be as in Notation 17.3.5. Show that choosing appro- 
priate coordinates on RP! = RU {oc} identifies the action of G; on 
G;/P; with the action of G; = SL(2,R) on RU {co} by linear-fractional 


transformations: 
a b (re ax +6 
c d cx td 


In particular, 

k; i 7 

el (x) = kx and ‘ (x) =x2+t. 
[Hint: Map a nonzero vector (1,22) € R? to its reciprocal slope x1/x2 € 
RU {oo}.] 


Let G;, Pj, and V; be as in Notation 17.3.5. Show that the map V; > 
G;/P;: v + vP; injective and measure-class preserving. 
[Hint: Exercise 3.] 


Show that Equation (17.3.8) is equivalent to 
il My 
lim a fF (p+ Z) aXe) = fl). 
k->oo \(Bi(0)) B,(0) k 
[Hint: A change of variables maps B;(0) onto B,(p) with r = 1/k.] 


Prove Corollary 17.3.9. 
[Hint: Exercise 5.] 


Complete the proof of Proposition 17.3.10: assume, for a.e. g € G, that 
aNgI is dense in G, and show, for a.e. v € V;, that aNvT is dense in G. 
[Hint: If a\gI is dense, then the same is true when g is replaced by any 
element of Cg(a) Ui g.] 


Let f € B(R?). Show that if f is essentially constant on a.e. vertical 
line and on a.e. horizontal line, then f is constant (a.e.). 


360 17. NORMAL SUBGROUPS OF [ 


#9. Assume Notation 17.3.5. Show that the only subgroups of G contain- 
ing P are P, G; x Po, P, x Go, and G. 
(Hint: P is the stabilizer of a point in RP! x RP', and has only 4 orbits. } 


#10. Assume Notation 17.3.5. Show that the only G-equivariant quotients 
of G/P are G/P, G2/P2, Gi /Pi, and G/G. 
[Hint: Exercise 9.] 


#11. Suppose BG is a sub-o-algebra of B(G/P). Show that B is closed under 
convergence in measure. 
More precisely, fix a probability measure p in the Lebesgue measure 
class on G/P, and show that B is a closed in the topology corresponding 
to the metric on B(G/P) that is defined by d(Aj, Az) = w(Ai A A). 


#12. Show that the action of G on B(G/P) is continuous. 


[Hint: Suppose gn, — e and (A, A A) > 0. The Radon-Nikodym derivative 
d(gn)«4/du tends uniformly to 1, so p4(gn An AgnA) > 0. To bound p(gnAA 
A), note that tact ydu— J, edu, for every yp € C.(G/P).| 


#13. In the notation of Remark 17.3.12, show that G/P, and G/P» are G- 
equivariantly diffeomorphic to RP? and G23, respectively. 


[Hint: Verify that the stabilizer of a point in RP? is P,, and the stabilizer of 
a point in G2,3 is P..] 


#14. Verify Equation (17.3.13). 
[Hint: Since a € P, we have agP = (aga')P, for any g € G.] 


#15. Prove Theorem 17.1.6’ under the assumption that G = SL(3, R). 


[Hint: You may assume (without proof) the facts stated in the “Modifications 
for SL(3, R)”] 


Notes 


The Normal Subgroups Theorem (17.1.1) is due to G. A. Margulis [5, 6, 7]. 
Expositions of the proof appear in [8, Chap. 4] and [12, Chap. 8]. (However, 
the proof in [12] assumes that G has Kazhdan’s property (T).) 

When I is not cocompact, the Normal Subgroups Theorem can be proved 
by algebraic methods derived from the proof of the Congruence Subgroup 
Problem (see [9, Thms. A and B, p. 109] and [10, Cor. 1, p. 75]). On the 
other hand, it seems that the ergodic-theoretic approach of Margulis provides 
the only known proof in the cocompact case. 

Regarding Remark 17.1.3(4), see [11] for an introduction to the Congru- 
ence Subgroup Property. 

Theorem 17.1.6 is stated for general G of real rank > 2 in [8, Cor. 2.13] and 
(12, Thm. 8.1.4]. Theorem 17.1.6’ is in [8, Thm. 4.2.11] and [12, Thm. 8.1.3]. 
See [12, §8.2 and §8.3] and [8, §4.2] for expositions of the proof. 

The proof of Theorem 17.2.1 is adapted from [8, 5.5.F, pp. 150-152]. 
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Chapter 18 


Arithmetic Subgroups 
of Classical Groups 


This chapter will give a quite explicit description (up to commensurability) of 
all the arithmetic subgroups of almost every classical Lie group G (see The- 
orem 18.5.3). (Recall that a simple Lie group G is “classical” if it is either 
a special linear group, an orthogonal group, a unitary group, or a symplectic 
group (see Definition A2.1).) The key point is that all the Q-forms of G are 
also classical, not exceptional, so they are fairly easy to understand. How- 
ever, there is an exception to this rule: some 8-dimensional orthogonal groups 
have Q-forms of so-called “triality type” that are not classical and will not be 
discussed in any detail here (see Remark 18.5.10). 

Given G, which is a Lie group over R, we would like to know all of its 
Qforms (because, by definition, arithmetic groups are made from Q-forms). 
However, we will start with the somewhat simpler problem that replaces the 
fields Q and R with the fields R and C: finding the R-forms of the classical 
Lie groups over C. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, T is a lattice in the semisimple Lie group G C SL(¢, R). 


You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: Restriction of Scalars (Section 5.5) and 


examples of arithmetic subgroups (Chapter 6). 
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§18.1. R-forms of classical simple groups over C 


To set the stage, let us recall the classical result that almost all complex 
simple groups are classical: 


(18.1.1) Theorem (Cartan, Killing). All but finitely many of the simple Lie 
groups over C are isogenous to either SL(n,C), SO(n, C), or Sp(2n, C), for 
some n. 


(18.1.2) Remark. Up to isogeny, there are exactly five simple Lie groups over C 
that are not classical. They are the “exceptional” simple groups, and are 
called Ee, Ez, Es, F4, and Go. 


Now, we would like to describe the R-forms of each of the classical groups. 
For example, finding all the R-forms of SL(n, C) would mean making a list of 
the (simple) Lie groups G, such that the “complexification” of G is SL(n, C). 
This is not difficult, but we should perhaps begin by explaining more clearly 
what it means. 

It has already been mentioned that, intuitively, the complexification of G 
is the complex Lie group that is obtained from G by replacing real num- 
bers with complex numbers. For example, the complexification of SL(n, R) is 
SL(n, C). In general, G is (isogenous to) the set of real solutions of a certain 
set of equations, and we let Gc be the set of complex solutions of the same 
set of equations: 


(18.1.3) Notation. Assume G C SL(¢,R), for some ¢. Since G is al- 
most Zariski closed (see Theorem A4.9), there is a certain subset Q of 
R[r11,.--, ee], such that G° = Var(Q)°. Let 


Gc = Varc(Q) = {g € SL(Z,C) | Q(g) = 0, for all QE Q}. 
Then Gc is a (complex, semisimple) Lie group. 
(18.1.4) Example. 
1) SL(n, R)c = SL(n, C). 
2) SO(n)c = SO(n, C). 
3) SO(m,n)c = SO(m +n, C) (see Exercise 1). 


(18.1.5) Definition. If Gc is isomorphic to H, then we say that 
e H is the complexification of G, and that 
e G is an R-form of H. 


The following result lists the complexification of each classical group. It 
is not difficult to memorize the correspondence. For example, it is obvious 
from the notation that the complexification of Sp(m, n) should be symplectic. 
Indeed, the only case that really requires memorization is the complexification 
of SU(m, n) (see Proposition 18.1.6(Aiv)). 
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(18.1.6) Proposition. Here is the complexification of each classical Lie group. 


A) Real forms of special linear groups: 
(i) SL(n, R)c = SL(n, C), 
(ii) SL(n, C)c = SL(n, C) x SL(n, C), 
(iii) SL(n, H)c & SL(2n, C), 
(iv) SU(m, n)c = SL(m + n, C). 
B) Real forms of orthogonal groups: 
(i) SO(m, n)c = SO(m +n, C), 
(ii) SO(n, C)c & SO(n, C) x SO(n, C), 
(iii) SO(n, H)c & SO(2n, C). 
C) Real forms of symplectic groups: 
(i) Sp(n, R)c = Sp(n, C), 
(ii) Sp(n, C)c = Sp(n, C) x Sp(n, C), 
(iii) Sp(m, n)c = Sp(2(m + n),C). 


Some parts of this proposition are more-or-less obvious (such as SL(n, R)c = 
SL(n, C)). A few other examples appear in Section 18.2 below, and the meth- 
ods used there can be applied to all of the cases. In fact, all of the calculations 
are straightforward adaptations of the examples, except perhaps the determi- 
nation of SO(n, H)c (see Exercise 18.2#4). 

Nothing in Proposition 18.1.6 is very surprising. What is not at all obvi- 
ous is that this list of real forms is complete: 


(18.1.7) Theorem (E. Cartan). Every real form of SL(n,C), SO(n,C), or 
Sp(n,C) appears in Proposition 18.1.6 (up to isogeny). 


We will discuss a proof of this theorem in Section 18.3. 


(18.1.8) Remarks. 


1) From Proposition 18.1.6, we see that a single complex group may have 
several different real forms. However, there are always only finitely 
many (even for exceptional groups). 


2) The Lie algebra of Gc is the tensor product g®C (see Exercise 2). This 
is independent of the embedding of G in SL(£, C), so, up to isogeny, Gc 
is independent of the embedding of G in SL(¢, C). 


3) We ignored a technical issue in Notation 18.1.3: there may be many 
different choices of Q (having the same set of real solutions), and it 
may be the case that different choices yield different sets of complex 
solutions. (In fact, a bad choice of Q can yield a set of complex solutions 
that is not a group.) To eliminate this problem, we should insist that 
O be maximal; that is, 


QO={QeER[z11,...,2¢2] | Q(g) =0, for all ge G}. 
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Then Gc is the Zariski closure of G (over the field C), from which it 
follows that Gc, like G, is a semisimple Lie group. 


(18.1.9) Example. Because the center of SL(3,R) is trivial, we see that 

SL(3, R) is the same Lie group as PSL(3, R). On the other hand, we have 
SL(3, R)c = SL(3, C) 4 PSL(3, C) = PSL(3, R)c. 

This is a concrete illustration of the fact that different embeddings of G can 

yield different complexifications. Note, however, that SL(3,C) is isogenous 


to PSL(3, C), so the difference between the complexifications is negligible (cf. 
Remark 18.1.8(2)). 


Exercises for §18.1. 


#1. Show that SO(m,n)c = SO(m+n,C). 
[Hint: SO(m, n)c is conjugate to SO(m+n, C) in SL(m+n, C), because —1 is 
a square in C.| 


#2. Show that the Lie algebra of Gc is g ®C. 


§18.2. Calculating the complexification of G 


This section justifies Proposition 18.1.6, by calculating the complexification 
of each classical group. 

Let us start with SL(n,C). This is already a complex Lie group, but we 
can think of it as a real Lie group of twice the dimension. As such, it has a 
complexification: 


(18.2.1) Proposition. SL(n, C)c = SL(n, C) x SL(n, C). 


Proof. We should embed SL(n, C) as a subgroup of SL(2n, R), find the corre- 
sponding set QO of defining polynomials, and determine the complex solutions. 
However, it is more convenient to sidestep some of these calculations by using 
restriction of scalars, the method described in §5.5. 

Define A: C3 C@C by A(z) = (z,Z). Then the vectors A(1) = (1,1) 
and A(i) = (i,—7) are linearly independent (over C), so they form a basis 
of C@C. Thus, A(C) is the R-span of a basis, so it is a R-form of C @ C. 
Therefore, letting V = C2”, we see that 

Ve = A(C”) = { (v,0) |veCc"} 
is a real form of V. Let 

(SL(n, C) x SL(n,C)).» = {g € SL(n,C) x SL(n, C) | g(Ve) = Vr}. 
Then we have an isomorphism 

A: SL(n,C) —> (SL(n, C) x SL(n,C))p, 
defined by A(g) = (9,9), so 
SL(n, C)c & ([SL(n, C) x SL(n, C)lr)¢ = SL(n, C) x SL(n, C). O 
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(18.2.2) Remarks. 


1) Generalizing Proposition 18.2.1, one can show that if G is isogenous to 
a complex Lie group, then Gc is isogenous to G x G. 


2) From Proposition 18.2.1, we see that Gc need not be simple, even if G' is 


simple. However, this only happens when G is complex: if G is simple, 
and G is not isogenous to a complex Lie group, then Gc is simple. 


Although not stated explicitly there, the proof of Proposition 18.2.1 is 
based on the fact that C@rp C=COC. Namely, the map 


C@rC—>C@C defined by v @AH A(v) dr 


is an isomorphism of C-algebras. Analogously, understanding the complexifi- 
cation of a group defined from the algebra H of quaternions will be based on 
a calculation of H ®p C. 


(18.2.3) Lemma. The tensor product H ®p C is isomorphic to Mat2x2(C). 


Proof. Define an R-linear map ¢: H > Mate,.2(C) by 

; a O ; 0 1 Oos:4 
It is straightforward to verify that @ is an injective ring homomorphism. 
Furthermore, $({1,i,3,k}) is a C-basis of Matz,2(C). Therefore, the map 


ob: H@ C > Matgy2(C) defined by ¢(v ® A) = o(v) A is a ring isomorphism 
(see Exercise 1). 


(18.2.4) Proposition. SL(n, H)c = SL(2n, C). 


Proof. From Lemma 18.2.3, we have 
SL(n, H)c & SL(n, Mat22(C)) = SL(2n, C) 


(see Exercises 2 and 3). 


As additional examples, let us look at the complexifications of the classical 
simple Lie groups that are compact, namely, SO(n), SU(m), and Sp(n). As 
observed in Example 18.1.4(2), we have SO(n)c = SO(n, C). The other cases 
are not as obvious. 

(18.2.5) Proposition. SU(n)c = SL(n,C). 
Proof. Let 

eo:CC, 

eo:C" >C", and 

e a: SL(n, C) > SL(n, C) 
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be the usual complex conjugations o(z) = Z, o (v) = 3, and G(g) = gy. We 
have 
SU(n) = {g € SL(n,C) | g*g = Id} 
={g € SL(n,C) | a(g")g = Id}, 
so, in order to calculate SU(n)c, we should determine the map 7) on SL(n, C) x 
SL(n, C) that corresponds to o when we identify C” with (C” © C”)p under 


the map A. 
First, let us determine 7. That is, we wish to identify C” with R2”, and 
extend @ toa C-linear map on C2”. However, as usual, we use the R-form 


Ac”), in place of R?”. It is obvious that if we 
define 7: C? @C” + C” OC” by F(z, y) = (y, 2), 


then 7 is C-linear, and the following diagram commutes: 


co 4 cea 
le | 
ce AL cect. 
Thus, it is fairly clear that 7(g,h) = (h,g). Hence 
SU(n)c = {(g,h) € SL(n, C) x SL(n, C) | H(g7,h*)(g,h) = (Id, 1d) } 

= {(g,h) € SL(n,C) x SL(n, C) | (h", 97) (9, h) = (Id, Id) } 

={(9,(97)"") |g € SL(n,C) } 

= SL(n, C). O 


(18.2.6) Proposition. Sp(n)c = Sp(2n, C). 
Proof. Let 


e ¢: HS Mate x2(C) be the embedding that is described in the proof of 
Lemma, 18.2.3, 


e 7 be the usual conjugation on H, 


QO: ah 
e J= CS 5] and 


e 7: Matex2(C) > Matzx2(C) be defined by n(x) = J~ta? J. 
Then 7 is C-linear, and the following diagram commutes: 
al Bae Mat2x2(C) 
i | 


H me Mat2x2(C). 
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Hence, because 
Sp(2) = {g € SL(2,H) | g*g =Id} 


-{fe Jee] 8 sa] e =) 
we see that 


sp@e={[% jl] SL (2,Mataxatey) | (5) ™G| (2 a] ral 


a b Nae ee Ge) “D 
={ff esuaaanaey [sf f]sfe 4] =n] 
= {g €SL(4,C) | J7'g7 Jg = Id} 
= {g€SL(4,C) | g’ Jg=J} 
= Sp(4, C). 
Similarly, letting 


In = € SL(2n,C), 


J 
the same calculations show that 
Sp(n)c = { g € SL(2n, C) | 97 dng = Jn } & Sp(2n,C). 


Exercises for §18.2. 


#1. In the proof of Lemma 18.2.3, verify: 
a) @ is an injective ring homomorphism, 
b) o({1,2,3,k}) is a C-basis of Matz,2(C), and 
c) b is an isomorphism of C-algebras. 

##2. Show SL(n, H)c & SL(n, Mate,2(C)). 


[Hint: Define ¢ as in the proof of Lemma 18.2.3. Use the proof of Proposi- 
tion 18.2.1, with ¢ in the place of A.] 


#3. Show SL(n, Mataxa(C)) ~ SL(dn, C). 


#4. Show that SO(n, H)c & SO(2n, C). 


[Hint: Similar to (18.2.6). To calculate 7, @ C, note that 7,(a2) = j~* r(a) j, 
for x € H.] 


§18.3. How to find the real forms of complex groups 


In this section, we will explain how to find all of the possible R-forms of 
SL(n,C). (Similar techniques can be used to justify the other cases of Theo- 
rem 18.1.7, but additional calculations are needed, and we omit the details.) 
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We take an algebraic approach, based on Galois theory, and we first review the 
most basic terminology from the theory of (nonabelian) group cohomology. 


§18.3(i). Definition of the first cohomology of a group. 


(18.3.1) Definitions. Suppose a group X acts (on the left) by automorphisms 
on a group M. (For x € X and m € M, we write *m for the image of m 
under 2.) 


1) A function a: X > M is a 1-cocycle (or “crossed homomorphism” ) if 
a(xy) = a(x) -7a(y) for all z,y € X. 
2) Two 1-cocycles a and { are equivalent (or “cohomologous” ) if there is 
some m € M, such that 


a(x) =m": B(x)-*m for alla € X. 
3) H'(X;M) is the set of equivalence classes of all 1-cocycles. It is called 
the first cohomology of X with coefficients in M. 


4) A 1-cocycle is a coboundary if it is cohomologous to the trivial 1- 
cocycle defined by r(x) = e for all x € X. 


(18.3.2) Warning. In our applications, the coefficient group M is sometimes 
nonabelian. In this case, H'(X;M) is a set with no obvious algebraic struc- 
ture. However, if M is an abelian group (as is often assumed in textbooks on 
group cohomology), then H'(X; M) is an abelian group. 


§18.3(ii). How Galois cohomology comes into the picture. For 
convenience, let Gc = SL(n, C). Suppose p: Gc — SL(N, C) is an embedding, 
such that p(Gc) is defined over R. We wish to find all the possibilities for 
the group p(Gc)r = p(Gc) A SL(N, R) that can be obtained by considering 
all the possible choices of p. 

Let ao denote complex conjugation, the nontrivial Galois automorphism 
of C over R. Since R= {z € C| o(z) =z}, we have 


SL(N,R) = {g € SL(N,C) | o(g) =g}, 
where we apply o to a matrix by applying it to each of the matrix entries. 
Therefore 


p(Gc)r = p(Gc) NSL(N,R) = {9 € p(Gc) | o(9) =g9}- 
Since p(Gc) is defined over R, we know that it is invariant under 0, so we 
have ; 
Gc > p(Gc) > p(Gc) Ge. 
Let ¢ = p top: Gc — Ge be the composition. Then the real form corre- 
sponding to p is 


Gr = p-*(p(Gc) NSL(N,R)) = {9 € Ge | (9) = g}. 
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To summarize, the obvious R-form of Gc is the set of fixed points of the usual 
complex conjugation, and any other R-form is the set of fixed points of some 
other automorphism of Gc. 

Now let 


a(o) =Ga': Ge > Ge. (18.3.3) 

It is not difficult to see that 

e a(c) is an automorphism of Gc (as an abstract group), and 

e a(c) is holomorphic (since p~' and opo~! are holomorphic — in fact, 

they can be represented by polynomials in local coordinates). 

So a(a) € Aut(Gc). Thus, by defining a(1) to be the trivial automorphism, 
we obtain a function a: Gal(C/R) > Aut(Gc). 

Let Gal(C/R) act on Aut(Gc), by defining 


>= ayo ' for y € Aut(Ge). 


Then a(o) = yp! %, so a(o) - °a(o) = a(1) (since 0? = 1). This means that 


a is 1-cocycle of group cohomology, and therefore defines an element of the 
cohomology set H'(Gal(C/R), Aut(Gc)). In fact: 


This construction provides a one-to-one correspondence 
between H'(Gal(C/R), Aut(Gc)) and the set of R-forms of Gc 
(see Exercise 1). Thus, finding all of the R-forms of Gc amounts to calculating 
the cohomology of a Galois group, or, in other words, “Galois cohomology” 


(18.3.4) 


(18.3.5) Observation. The above discussion is an example of a fairly gen- 
eral principle: if X is an algebraic object that is defined over R, then 
H'(Gal(C/R), Aut(Xc)) is in one-to-one correspondence with the set of R- 
isomorphism classes of R-defined objects whose C-points are isomorphic 
to Xc. 


(18.3.6) Example. Suppose V; and V2 are two vector spaces over R, and 
they are isomorphic over C. (Le., Vi @ C & V2 ®C.) Then the two vector 
spaces have the same dimension, so elementary linear algebra tells us that they 
are isomorphic over R. This means that the R-form of any complex vector 
space Vc is unique (up to isomorphism), so the general principle (18.3.5) tells 
us 


H'(Gal(C/R), Aut(Vc)) = 0. 
In other words, we have 
H'(Gal(C/R), GL(n,C)) = 0. 
A similar argument shows H'(Gal(C/R), SL(n, C)) = 0 (see Exercise 2). 


(18.3.7) Warning. The “fairly general principle” (18.3.5) is not completely 
general. Although almost nothing needs to be assumed in order to construct 
a well-defined, injective map from the set of Q-forms to the cohomology set 
(cf. Exercise 1), this map might not be surjective. That is, there might be 
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cohomology classes that do not come from Q-forms, unless some (fairly mild) 
hypotheses are imposed on the class of algebraic objects. 


(18.3.8) Warning. Up to now, we have usually ignored finite groups in this 
book: an answer up to isogeny or commensurability was good enough. How- 
ever, such sloppiness is unacceptable when calculating Galois cohomology 
groups. For example, even though SL(n,C) is isogenous to PSL(n,C), the 
two groups have completely different cohomology. Namely: 


e we saw in Example 18.3.6 that H'(Gal(C/R), SL(n, C)) is trivial, but 
e Subsection 18.3(iii) will show H'(Gal(C/R), PSL(n, C)) is infinite. 


§18.3(iii). Constructing explicit R-forms from cohomology classes. 
Given a € H'(Gal(C/R), Aut(Gc)), we will now see how to find the corre- 
sponding R-form Gp. 

It is known that the outer automorphism group of Gc = SL(n,C) has 
only one nontrivial element, namely, the “transpose-inverse” automorphism, 


defined by w(g) = (g7)~!. So 
Aut(Gc) = PSL(n, C) x (w). 
We consider two cases. 


Case 1. Assume a € H'(Gal(C/R), PSL(n,C)). It is a fundamental fact in 
the theory of finite-dimensional algebras that every C-linear automorphism 
of the matrix algebra Matny»(C) is inner (see Exercise 5). Since the center 
acts trivially, this means Aut(Matn,n(C)) = PSL(n,C). Therefore, 
H'(Gal(C/R), PSL(n, C)) = H'(Gal(C/R), Aut (Matnxn(C))), 

so, by the general principle (18.3.5), we can identify this cohomology set with 
the set of R-forms of Mat, x,(C). More precisely, it is the set of algebras A 
over R, such that A®C = Matyx,(C). Such an algebra must be simple (since 
Matnxn(C) is simple), so, by Wedderburn’s Theorem (6.8.5), it is a matrix 
algebra over a division algebra: A = Mat,(D), where D is a division algebra 
over R. The corresponding R-form Gp is SL(k, D). It is well known that the 
only division algebras over R are R, C, and H (see Exercise 6), so the real 
form must be either SL(k, R), SL(k,C), or SL(k,H), all of which are on the 
list in Proposition 18.1.6(A). 


Case 2. Assume the image of a is not contained in PSL(n,C). In this case, 
we have a(o) = (conjugation by A)w for some A € GL(n,R). Hence, for 
every g © Gr, 


9 = (9) = (a(o) o) (9) = Aw(o(g)) A“* = A(("9)7) A, 
which means g A (%q)? = A. In other words, g is in the unitary group SU(A, a) 
corresponding to the Hermitian form on C” that is defined by the matrix A. 
Since every Hermitian form on C” is determined (up to isometry) by the 


number of positive and negative eigenvalues of A, we conclude that Gr = 
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law) 


SU(m, n) for some m and n. So Gp is listed in Proposition 18.1.6(A). 


Exercises for §18.3. 


wet 


#2. 


#3. 


4A, 


#5. 


Suppose 91(Gc)p and p2(Gc)pg are two R-forms of SL(n, C), with cor- 
responding l-cocycles a; and ag. 

a) Show that if p1(Gc)r & po(Gc)r, then a, and a2 are cohomolo- 
gous. (So the correspondence in (18.3.4) is well-defined.) 

b) Conversely, show that if a; is cohomologous to a2, then we have 
pi(Gc)r & p2(Gc)r. (So the correspondence in (18.3.4) is one-to- 
one.) 

In Subsection 18.3(iii), a real form of SL(n,C) is constructed for each 
cohomology class a. This shows that the correspondence is onto, and 
therefore completes the proof of (18.3.4). 


[Hint: In (a), you may assume, without proof, that every isomorphism from 
pi(Gc)r to p2(Gc)r extends to an isomorphism from pi(Gc) to p2(Gc).| 
Show H*(Gal(C/R),SL(n,C)) = 0, by identifying SL(n,C) with the 
automorphism group of a pair (V,€), where V is an n-dimensional vector 
space and € is a nonzero element of the exterior power A" V. 


The short exact sequence 
det, 


1 SL(n, C) @ GL(n, C) —$ C* > 1 
gives rise to the following long exact sequence of cohomology: 
H(Gal(C/R), GL(n, C)) + H(Gal(C/R), C*) 
— H'(Gal(C/R), SL(n, C)) + H'(Gal(C/R), GL(n, C)). 
Show that the first map in this sequence is surjective, and combine 
this with the vanishing of the last term to provide another proof that 
H'(Gal(C/R), SL(n,C)) = 0. 
[Hint: The Oth cohomology group is the set of fixed points of the action.| 


Show that if n is odd, then every R-form of SO(n,C) is isogenous to 
SO(p,q), for some p and q. 
[Hint: You may assume, without proof, that every automorphism of SO(n, C) 


is inner. Also note that SO(n, C) = PSO(n,C) (why?). Both of these obser- 
vations require the assumption that n is odd.| 


Show that if a is any C-linear automorphism of the ring Mat, .»(C), 
then there exists T € GL(n,C), such that p(X) = TXT! for all 
X € Matnxn(C). 

(Hint: For A = Matnxn(C), make C” into a simple A-module via a * v = 
a(a)v. However, the usual action on C” is the unique simple A-module (up to 


isomorphism), because A is a direct sum of submodules that are isomorphic 
to C”.] 
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#6. Show: 
a) C is the only finite field extension of R (other than R itself). 
b) His the only division algebra over R that is not commutative. 


[Hint: (a) You may assume, without proof, that C is algebraically closed. 
This implies that every irreducible real polynomial is either linear or qua- 
dratic. (b) If c € D\ R, then R{z] is a field extension of R; identify it 
with C. Then conjugation by 72 is a C-linear map on D. Choose j to be in 
the —1-eigenspace, and let b = j?. Show b € R and D & H, ”?.] 


§18.4. The Q-forms of SL(n, R) 


To illustrate how the method of the preceding section is used to find Q-forms, 
instead of R-forms, we prove the following result that justifies the claims made 
in Chapter 6 about arithmetic subgroups of SL(n, R): 


(18.4.1) Theorem (cf. Section 6.8). Every Q-form Gg of SL(n,R) is either 
a special linear group or a unitary group (perhaps over a division algebra). 


(18.4.2) Remark. More precisely, Gg is isomorphic to either: 
1) SL(m, D), for some m and some division algebra D over Q, or 
2) SU(A,7;D) = {g € SL(k, D) | gA(g)* = A}, where 
e D is a division algebra over Q, 
e 7 is an anti-involution of D that acts nontrivially on the center 
of D, and 
e Aisa matrix in Matz.,(D) that is Hermitian (i.e., (7A)? = A). 


The proof is based on the following connection with Galois cohomology. 
We will work with Gc, instead of G, because algebraically closed fields are 
much more amenable to Galois Theory. (That is, we are replacing SL(n, R) 
with SL(n,C) to avoid technical issues. ) 


(18.4.3) Proposition. There is a one-to-one correspondence between the Q- 
forms of Gc and the Galois cohomology set H'(Gal(C/Q), Aut(Gc)). 


Proof. We assume familiarity with the proof in Section 18.3, and highlight 
the changes that need to be made. 

Suppose we have an embedding p: Gc — SL(N,C), such that p(Gc) 
is defined over Q. The main difference from Section 18.3 is that, unlike 
Gal(C/R), the Galois group Gal(C/Q) has infinitely many nontrivial elements, 
and we need to consider all of them: since 


Q={zEC]| o(z) =z, Vo € Gal(C/Q)}, 


we have 


p(Gc)o = {9 € pc(Gc) | o(g) = 9, Vo € Gal(C/Q) }. 
For each o € Gal(C/Q), let 
=p 'op:Gc>Gc and a(o)=G07': Ge > Ge. 
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Then 


Geg={g9 € Gc|a(g) = 9, Vo € Gal(C/Q) }, 


and a(a) € Aut(Gc). Furthermore, since a(o) = p-'apa~! = p-'% is 


formally a 1-coboundary, it is easily seen to be a 1-cocycle, and therefore 
represents a cohomology class in H'(Gal(C/Q), Aut(Gc)). 

This defines the desired map from the set of Q-forms to the Galois coho- 
mology set. It can be proved to be well-defined and injective by replacing R 
with Q in Exercise 18.3#1. That the map is surjective will be established in 
the proof of Theorem 18.4.1 below, where we explicitly describe the Q-form 
corresponding to each cohomology class. 


More generally, we have the following natural analogue of Observa- 
tion 18.3.5: 


(18.4.4) Observation. If X is an algebraic object that is defined over Q (and 
satisfies mild hypotheses; cf. Warning 18.3.7), then the Galois cohomology 
set H'(Gal(C/Q), Aut(Xc)) is in one-to-one correspondence with the set of 
Q-isomorphism classes of Q-defined objects whose C-points are isomorphic 
to Xc. 


(18.4.5) Corollary (cf. Example 18.3.6). 
H'(Gal(C/Q), GL(n,C)) =0 and H*(Gal(C/Q), SL(n,C)) = 0. 


Proof of Theorem 18.4.1. Let Gc = SL(n,C). As in Subsection 18.3(iii), 
we have 
Aut(Gc) = PSL(n,C) x (w), 


where w(g) = (g7)~!. Given a € H'(Gal(C/Q), Aut(Gc)), corresponding to 
a Q-form Gg, we consider two cases. 


Case 1. Assume a € H'(Gal(C/Q), PSL(n, C)). By arguing exactly as in 
Case 1 of Subsection 18.3(iii) (but with Q in the place of R), we see that 
Ga = SL(k, D), for some k and some division algebra D over Q. 


Case 2. Assume the image of a is not contained in PSL(n,C). Since the 
outer automorphism group Out(Gc) is of order 2, it has no nontrivial auto- 
morphisms. Therefore, the action of the Galois group Gal(C/Q) on Out(Gc) 
must be trivial. Hence, if we let @: Gal(C/Q) — Out(Gc) be the 1-cocycle 
obtained from a by modding out PSL(n,C), then @ is an actual homomor- 
phism (not merely a “crossed homomorphism” ). 

By the assumption of this case (and the fact that |Out(Gc)| = 2), the 
kernel of @ is a subgroup of index 2 in Gal(C/Q). This means that the fixed 
field of ker @ is a quadratic extension L = Q [V7] of Q. Then, by construction, 
we have Gal(C/L) = ker @. 

For any o € Gal(C/ZL), the conclusion of the preceding paragraph tells 
us that @(c) is trivial. For simplicity, let us assume that the bar can be 
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removed, so a(c) is trivial (see Correction 18.4.6(1)). Since, by definition, 
we have a(o) = ga! (see (18.3.3)), this implies o = ¢. Therefore, for any 
g € Gg, we have g? = g® = g. Since this holds for all o € Gal(C/L), we 
conclude that g € SL(n, L). 

Now, for the unique nontrivial t € Gal(L/Q), we have rt ¢ kera@, so 
a(r) = (conj by A)w for some A € GL(n,R). Hence, for any g € Gg, we 
have 

g =F (9) = (a(r)7)(g) = Aw(7(g)) A = A((9)") A, 
so g A("g)’ = A, which means g € SU(A,7;L). Furthermore, the equation 
7? = 1 provides an equation that can be used to show A is Hermitian (or, 
more precisely, can be chosen to be Hermitian) (see Exercise 1). i) 


(18.4.6) Corrections. 


1) Mixed case. We seem to have shown that all Q-forms of SL(n, R) can 
be constructed from either division algebras (Case 1) or unitary groups 
(Case 2). However, the discussion in Case 2 assumes that a(c) is triv- 
ial, when all we actually know is that @(c) is trivial. Removing this 
assumption means that @ can map a part of the Galois group into 
PSL(n,C). In other words, in addition to the homomorphism @, there 
is a nontrivial cocycle from Gal(C/L) to PSL(n,C). By the argument 
of Case 1, this cocycle yields a division algebra D over L. The resulting 
Qform Gg = SU(A,7;D) is obtained by combining division algebras 
with unitary groups. 

2) C vs. Q. We should really be using the algebraic closure Q of Q, instead 
of C. The Galois cohomology set H'(Gal(Q/Q), Aut(Gg)) is defined to 
be the natural limit of the sets H'(Gal(F/Q), Aut(Gg)), where F' ranges 
over all finite Galois extensions of Q. 


Exercises for §18.4. 


#1. In Case 2 of the proof of Theorem 18.4.1, show that the matrix A can 
be chosen to be Hermitian. 


[Hint: A must be a scalar multiple \ of a Hermitian matrix (since T? = 1). 
Use the fact that H'(Gal(C/L);C*) is trivial (why?) to replace A with a 
scalar multiple of itself that makes \ = 1.] 


§18.5. Q-forms of classical groups 


By arguments similar to the ones applied to SL(n, R) in Section 18.4, it can be 
shown that the Q-forms of almost any classical group come from special linear 
groups, unitary groups, orthogonal groups, or symplectic groups. However, 
the special linear groups and unitary groups may involve division algebras, 
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and restriction of scalars (5.5.8) implies that the groups may be over an exten- 
sion F' of Q. (Recall that unitary groups over division algebras were defined 
in Definition 6.8.13, and the involutions 7, and 7, on the quaternion algebra 
H a were defined in Example 6.8.12.) Here is a list of the groups that arise: 


(18.5.1) Definition. For any algebraic number field F’, and any n, the fol- 
lowing groups are said to be of classical type: 
1) SL(n, D), where D is a division algebra whose center is F’. 
) Sp(2n, F). 
3) SO(A; F’), where A is an invertible, symmetric matrix in Mat, y,(F’). 
) SU(A, 7; H®”), where H%” is a quaternion division algebra over F, and 
A is an invertible, 7--Hermitian matrix in Matnxn(H®’). 


5) SU(A, 7; H%”), where H%2” is a quaternion division algebra, and A is 


: : ae bon te a 
an invertible, 7,-Hermitian matrix in Mat,.,,(H%’). 


6) SU(A,7; D), where 
e D is a division algebra whose center is a quadratic extension L 
of F, 
e 7 is an anti-involution whose restriction to L is the Galois auto- 
morphism of L over F’, and 
e A is an invertible, r-Hermitian matrix in Mat,.»(D). 


(18.5.2) Remark. Definition 18.5.1 is directly analogous to the list of classical 
simple Lie groups (see Examples A2.3 and A2.4). Specifically: 


1) SL(n, D) is the analogue of SL(n, R), SL(n, C), and SL(n, 


MN 


2) Sp(2n, F’) is the analogue of Sp(2n, R) and Sp(2n, C). 
3) SO(A; F) is the analogue of SO(m,n) and SO(n, C). 
4) SU(A, 7; H ua is the analogue of Sp(m,n). 

5) SU(A,7,;H®”) is the analogue of SO(n, H). 


6) SU(A,7; D) (with 7 nontrivial on the center) is the analogue of SU(m, n). 


(18.5.3) Theorem. Suppose 

e G is classical, and 

e no simple factor of Gc is isogenous to SO(8, C). 
Then every irreducible, arithmetic lattice in G is commensurable to the integer 
points of some group (of classical type) listed in Definition 18.5.1. 


(18.5.4) Remark. To state the conclusion of Theorem 18.5.3 more explicitly, 
let us assume, for simplicity, that the center of G is trivial. Then Theo- 
rem 18.5.3 states that there exist: 
e algebraic number field F’, with places S°° and ring of integers O, 
e a group G F listed in Definition 18.5.1, with corresponding semisimple 
Lie group G that is defined over F , and 
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e a homomorphism y: || Gres G, with compact kernel, 


aEse 
such that y(A(Go)) is commensurable to I (cf. Proposition 5.5.8). 


(18.5.5) Warning. Although y(A(Go)) is commensurable to T, this does 
not imply that y(A(G F)) is commensurable to Gg. For example, the image 
of SL(2, Q) in PSL(2, Q) has infinite index (cf. Exercise 5.2#1). 


Each of the groups in Definition 18.5.1 has a corresponding semisimple 
Lie group G that is defined over F’. Before determining which Lie group 
corresponds to each F’-group, we first find the complexification of G. This is 
similar to calculations that we have already seen, so we omit the details. 


(18.5.6) Proposition (cf. Section 18.2). The notation of each part of this 
proposition is taken from the corresponding part of Definition 18.5.1. We use 
d to denote the degree of the central division algebra D, and the matrix A is 
assumed to ben xX n. 


1) SL(n, D @p C) = SL(dn, C). 

2) Sp(2n, C) = Sp(2n, C) (obviously!). 
3) SO(A;C) = SO(n, C). 

4) SU(A, t-; Hf? @r C) & Sp(2n,C). 

5) SU(A, 7; H®” @p C) & SO(2n, C). 
6) SU(A,7; D @r C) = SL(dn, C). 


If F ¢ R, then the semisimple Lie group G corresponding to Gr is the 
complex Lie group in the corresponding line of the above proposition. How- 
ever, if F C R, then G is some R-form of that complex group. The following 
result lists the correct R-form for each of the groups of classical type. 


(18.5.7) Proposition. The notation of each part of this proposition is taken 
from the corresponding part of Definition 18.5.1. We use d to denote the 
degree of the central division algebra D, and the matrix A is assumed to be 
nxn. 
Assume F is an algebraic number field, and that F C R. Then: 
ae SL(dn, R) if D@pr RS Matgya(R), 
)= SL(dn/2,H) otherwise. 


2) Sp(2n, R) = Sp(2n, R) (obviously!). 


1) SL(n, D®@Fr 


2n, IR if HS } 
4) SU(A, To; He? ®@rR)& Sp(2n,R) if [ 
Sp(p,n—p) if Hp 


SO(p,2n—p) if HY? Y Mato,2(R), 
SO(n, H) if He? = H. 


5) SU(A, 7,;H% @p R) & 
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if L CR and 
DSrRE Mataxa( R), 


SL(dn/2,H) otherwise. 


6) SU(A,7;D @p R) & ¢ SL(dn,R) 


(18.5.8) Remark. Proposition 18.5.7 does not specify the value of p, where 
it appears. However, it can be calculated for any particular matrix A. For 
example, to calculate p in (6), note that, because L ¢ R, we have 


D®rFREDE,CE Mataxa(C), 
so we may think of A € Matnxn(D) as a (dn) x (dn) Hermitian matrix. Then 
p is the number of positive eigenvalues of this Hermitian matrix (and dn — p 


is the number of negative eigenvalues). We have already seen this type of 
consideration in Notation 6.4.10 and Proposition 6.4.11. 


(18.5.9) Remark. The table on page 380 summarizes the above results in a 
format that makes it easy to find the arithmetic subgroups of any given simple 
Lie group G (or, by restriction of scalars, to find the irreducible arithmetic 
subgroups of any semisimple Lie group that has G as a simple factor), ex- 
cept that (as indicated by “?”) the list is not complete for groups whose 
complexification is isogenous to SO(8, C). 

The arithmetic group I that corresponds to a given F-form G'r is obtained 
by: 


e replacing F' with its ring of integers O, or 
e replacing D with an order Op (see Lemma 6.8.7). 


By restriction of scalars (5.5.8), [is an arithmetic subgroup of [[,¢g0 G?. 

A parenthetical reference indicates the corresponding part of Defini- 
tion 18.5.1, and also of Proposition 18.5.6 (for F, = C) and Proposition 18.5.7 
(for F, = R). The reference column (combined with the “m or p+q” column) 
also lists additional conditions that determine Gr @ pF is the desired simple 
Lie group G (except that the parameters p and q will need to be calculated, 
if they arise). 

The Q-rank of the corresponding arithmetic group I is either given explic- 
itly (as a function of 7), or is the dimension of a maximal isotropic subspace 
(of the associated vector space over either the field F' or the division algebra D, 
as indicated). 


(18.5.10) Remark (“triality”). Perhaps we should explain why the statement 
of Theorem 18.5.3 assumes no simple factor of Gc is isogenous to SO(8, C). 
Fundamentally, the reason PSO(8,C) is special is that, unlike all the other 
simple Lie groups over C, it has an outer automorphism @ of order 3, called 
“triality” For all of the other simple groups, the outer automorphism group 
is either trivial or has order 2. 
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Lie group G | F-form Gr reference m or p+q rankg [ 
SL(m, R) SL(n, D) (1), PCR, m =dn n—-1 
D split/R 
SU(B,7; D) (6), FCLCR,| m=adn D-subspace 
D split/R 
SL(m, C) SL(n, D) (1), F ZR m=dn n—-1 
SU(B,7; D) (6), F ZR m=dn_ | D-subspace 
(so L ¢ R) 
SL(m,H) | SL(n,D) (1), FCR, | m=dn/2,| n-1 
D not split/R d even 
SU(B,7; D) (6), FCLCR,| ma dn/2, | D-subspace 
D not split/R d even 
SU(p, q) SU(B,7; D) (6), F CR, p+qz=dn | D-subspace 
LZR 
SO(p, ¢) SO(B; F’) (3), GIR p+q=n | F-subspace 
SU(B,7,;D (5), F CR, + q = 2n, | D-subspace 
| “Dspiye | PTE 
? Remark 185.10 | p+q= ? 
SO(m, C) SO(B; F) (3), F ZR nan F’-subspace 
SU(B,7;; D) (5),F ZR m=2n, | D-subspace 
d — 
? Remark 18.5.10 n= ? 
SO(m,H) | SU(B,7,;D) (5), F CR, m=n,_ | D-subspace 
D not split/R d-2 
? Remark 18.5.10 m=A4 ? 
Sp(2m, R) Sp(2n, F’) (2), F CR m=n n 
SU(B, 7; D) (4), F CR, m=n, | D-subspace 
D split/R B29 
Sp(2m, C) Sp(2n, F’) (2),F ZR =n n 
SU(B,7.; D) (4),F ZR m=n,_ | D-subspace 
d=? 
Sp(p, SU(B,7,.;D (4), F CR, +q=n, | D-subspace 
PtP. 4) ( D not split/R fa 


See Remark 18.5.9 on page 379 for an explanation of this table. 
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Here is how the triality automorphism ¢@ can be used to construct Q- 
forms that are not listed in Theorem 18.5.3. We first choose any homomor- 
phism a: Gal(C/Q) 2°“ (6) (so the kernel of a is a cubic, Galois extension 
of Q). The triality automorphism can be chosen so that it commutes with 
the action of the Galois group (in other words, ¢ is “defined over Q”), so 
the homomorphism a is a 1-cocycle into Aut (PSO(8, C)). Therefore, by the 
correspondence between cohomology and Q-forms (18.4.3), there is a corre- 
sponding Q-form Gg. This Q-form is not any of the groups listed in Theo- 
rem 18.5.3, because, for all those groups, the image of the induced homomor- 
phism @: Gal(C/Q) — Out(Gc) has order 1 or 2, not 3. 

Mathematicians who understand the triality automorphism can construct 
the corresponding Q-form explicitly, by reversing the steps in the proof of 
Proposition 18.4.3. Namely, for each a € Gal(C/Q), let 


Gg =a(o)-o € Aut(PSO(8, C)). 
Then 
Go = {g € PSO(8, C) | o(g) = 9, Vo € Gal(C/Q) }. 


§18.6. Applications of the classification of arithmetic groups 


Several results that were stated without proof in previous chapters are easy 
consequences of the above classification of F'-forms. 


(18.6.1) Corollary (cf. Proposition 6.4.5). Suppose [ is an arithmetic sub- 
group of SO(m,n), andm-+n > 5 is odd. Then there is a finite extension F 
of Q, with ring of integers O, such that T is commensurable to SO(A;O), 
for some invertible, symmetric matriz A in Matnxn(F). 


Proof. Let G = SO(m,n). Restriction of scalars (5.5.16) implies there is 
a group G that is defined over an algebraic number field F’ and has a sim- 
ple factor that is isogenous to G, such that I is commensurable to Ce By 
inspection, we see that a group of the form SO(m, n) never appears in Propo- 
sition 18.5.6, and appears at two places in Proposition 18.5.7. However, in 
our situation, we know that m+n is odd, so the only possibility for @ F is 
SO(A; F’). Therefore, [ is commensurable to SO(A; 0). 


(18.6.2) Corollary (cf. Remark 9.1.7(3b)). If G = SO(2,n), with n > 5, 
and n is odd, then rankpG = 2, but there is no lattice T in G, such that 
rankg T= 1. 


Proof. We have rankg SO(2,n) = min{2,n} = 2 (see Proposition 8.1.8). 
From the Margulis Arithmeticity Theorem (5.2.1), we know that I is 
arithmetic, so Corollary 18.6.1 tells us that T is of the form SO(B; O), where 


e © is the ring of integers of some algebraic number field F', and 


e B is asymmetric bilinear form on F"t?. 
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If rankg TP = 1, then G/T is not compact, so Corollary 5.3.2 tells us that we 
may take F’ = Q; therefore O = Z. We see that: 


1) B has signature (2,n) on Rt? (because G = SO(2,n)), and 


2) no 2-dimensional Q-subspace of Q”*? is totally isotropic (because we 
have rankgT < 2). 


Recall the following important fact that was used in the proof of Proposi- 
tion 6.4.1: 


Meyer’s Theorem. /f Bo(z,y) is any nondegenerate, symmetric bi- 
linear form on R¢, such that 
o B is defined over Q, 


o d>5, and 
o Bo is isotropic over R (that is, B(v,v) = 0 for some nonzero 
v € RY), 
then Bo is also isotropic over Q (that is, B(v,v) =0 for some nonzero 
v € Q*). 


This theorem, tells us there is a nontrivial isotropic vector v € Q”*?. Then, 
because B is nondegenerate, there is a vector w € Q"*?, such that B(v,w) = 1 
and B(w,w) = 0. Let V be the R-span of {v,w}. Because the restriction 
of B to V is nondegenerate, we have R°+? = V @ V+. This direct sum is 
obviously orthogonal (with respect to B), and the restriction of B to V has 
signature (1,1), so we conclude that the restriction of B to V+ has signature 
(1,n —1). Hence, there is an isotropic vector in V+. By applying Meyer’s 
Theorem again, we conclude that there is an isotropic vector z in (V+)g. 
Then (v,z) is a 2-dimensional totally isotropic subspace of Q"t?. This is a 
contradiction. Oo 


(18.6.3) Corollary (cf. Proposition 6.4.2). If n ¢ {3,7}, then every nonco- 
compact, arithmetic subgroup of SO(1,n) is commensurable to a conjugate of 
SO(A;Z), for some invertible, symmetric matriz A € Matnxn(Q). 


Proof. Assume, for simplicity, that n = 5, and let T be a noncocompact, 
arithmetic subgroup of G = SO(1,5). Since [ is not cocompact, there is 
no need for restriction of scalars: I’ corresponds to a Q-form Gg on G itself 
(see Corollary 5.3.2). We may assume the Q-form is not SO(A;Q); other- 
wise, I’ is as described. Therefore, by inspection of Proposition 18.5.6 and 
Proposition 18.5.7, we see that Gg must be of the form SU(A, 7;; Hg”), where 


A € Matsx3(Hg’), and Hg” @g R & Matex2(R). 


Because G/T is not compact, there is a vector v € (Hg”)°, such that 
T,(v) Av = 0 (see Exercise 18.7#4). Hence, it is not difficult to see that, by 
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making a change of basis, we may assume 


F 0 J 
A=j|0 -1 O|, for some p € Hg”. 
lo 0 | 


Since the identity matrix Id2,2 is the image of 1 € H under any isomor- 


ae il 


G = SU(A;H%” @q R) & SO(Ap;R), where Ag = | Pa | 


phism ale ®g R—- Mate 2(R), this means 


* Ok 
* Ok 


Therefore, G' is isomorphic to either SO(2,4) or SO(3,3); it is not isogenous 
to SO(1,5). This is a contradiction. 


(18.6.4) Proposition (see Proposition 6.6.5). Every noncocompact, arith- 
metic subgroup of SL(3,R) is commensurable to a conjugate of either SL(3, Z) 
or a subgroup of the form SU(J3,0;O0), where 


0 0 1 
edg=/0 LD DO), 
1 0 0 


e L isa real quadratic extension of Q, with Galois automorphism o, and 


e O is the ring of integers of L. 


Proof. Let I be an arithmetic subgroup of G = SL(3,R), such that G/T is 
not compact. We know, from the Margulis Arithmeticity Theorem (5.2.1), 
that [ is arithmetic. Since G/T is not compact, there is no need for restriction 
of scalars (see Corollary 5.3.2), so there is a Q-form Gg of G, such that T is 
commensurable to Gz. By inspection of Propositions 18.5.6 and 18.5.7, we see 
that there are only two possibilities for Gg. We consider them individually, 
as separate cases. 


Case 1. Assume Gg = SL(n, D), for some central division algebra D of de- 
gree d over Q, with dn = 3. Because 3 is prime, there are only two possibilities 
for n and d. 


Subcase 1.1. Assume n = 3 and d= 1. Since d = 1, we have dimg D = 1, so 
D=Q. Therefore, Gg = SL(3, Q). Sol & SL(3, Z). 
Subcase 1.2. Assume n = 1 and d = 3. We have Gg = SL(1, D). Therefore 


I = SL(1, Op) is cocompact (see Proposition 6.8.8(2)). This is a contradic- 
tion. 


Case 2. Assume Gg = SU(A,7;D), for A,D,o as in 18.5.1(6), with F =Q, 
LCR, anddn =3. If n = 1, then Gg C SL(1, D), so it has no unipotent 
elements, which contradicts the fact that G/T is not compact. Thus, we may 
assume that n = 3 andd=1. 
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Since d = 1, we have D = L, so Gg = SU(A,¢; L), where a is the (unique) 
Galois automorphism of L over Q, and B is a o-Hermitian form on L?°. 

Since I’ is not cocompact, we know rankgI > 1, so there is some nonzero 

v € L? with v? Av = 0 (cf. Example 9.1.5(4)). From this, it is not difficult to 

construct a basis of L? in which A is a scalar multiple of J3 (see Exercise 3). 

Oj 


Exercises for §18.6. 


#1. Assume that F C R, and that A is an invertible, symmetric matrix in 
Matnxn(F’). Show that if exactly p of the eigenvalues of A are positive, 
then SU(A;R) = SO(p, n — p). 


#2. Suppose 
e FCR,H ae is a quaternion division algebra over F’, and 
e Ais an invertible 7.-Hermitian matrix in Mat,,.,(D) that is diag- 
onal. 
Show: 
a) every entry of the matrix A belongs to F' (and, hence, to R), and 
b) if exactly p of the diagonal entries of A are positive, then we have 
SU(A, 7%; HY? @p R) = Sp(p,n — p). 


#3. Complete the proof of Proposition 18.6.4, by showing that we may 
assume A= J. 
[Hint: Assume v1 and v3 are isotropic, and v2 is orthogonal to both v; and v3. 
Multiply A by a scalar in Q, so v3 Ave = 1. Then normalize v3, so vj Avs = 1.] 
#4. (B. Farb) For each n > 2, find a cocompact lattice T,, in SL(n, R), such 
that Tp CT3 CTy C---. (If we did not require [,, to be cocompact, 
we could let [',, = SL(n, Z).) 


#5. Show that if G = SU(A,7,;H as as in Definition 18.5.1(5), then there 
exists A’ € Matnxn(H ao such that A’ is skew-Hermitian with respect 
to the standard anti-involution 7, and G = SU(A’,7,; HI”). 

[Hint: Use the fact that 7,(2) = j~'te(x)j.] 


§18.7. G has a cocompact lattice 


We have already seen that if G is not compact, then it has a noncocompact 
lattice (see Corollary 5.1.17). In this section, we will show there is also a 
lattice that is cocompact: 


(18.7.1) Theorem. G has a cocompact, arithmetic lattice. 


To illustrate the main idea, we briefly recall the prototypical case, which 
is a generalization of Example 5.5.4. 


(18.7.2) Proposition. SO(m,n) has a cocompact, arithmetic lattice. 
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Proof. Let 

e F=Q(v2), 

e o be the Galois automorphism of F’ over Q, 

e O=Z\y2I, 

© Bix, y) = hai t5y5 — V2 O51 Up sYp+y, for x,y © FPt, 

° G=S0(B)°, 

e [= Go, and 

e A: Gr > G x G? defined by A(g) = (9, 0(g)). 
We know (from restriction of scalars) that A(I) is an irreducible, arithmetic 
lattice in G x G? (see Proposition 5.5.8). Since G7 & SO(p+q) is compact, we 


may mod it out, to conclude that [ is an arithmetic lattice in G = SO(m,n)?°. 
Also, since G? is compact, we know that [ is cocompact (see Corollary 5.5.10). 


More generally, the same idea can be used to prove that any simple 
group G has a cocompact, arithmetic lattice. Namely, start by letting K 
be a compact group that has the same complexification as G. (For classical 
groups, the correct choice of K can be found by looking at Proposition 18.1.6.) 
Then show that G x K has an irreducible, arithmetic lattice. More precisely, 
construct 


e an extension F' of Q that has exactly two places o and 7, and 

e® a group G that is defined over F : 
such that G? and G7 are isogenous to G and K, respectively. 

Although we could do this explicitly for the classical groups, we will save 
ourselves a lot of work (and also be able to handle the exceptional groups at 
the same time) by quoting the following powerful theorem. 

(18.7.3) Theorem (Borel-Harder). Suppose 

e F is an algebraic number field, 

e S° is the set of places of F, 

e G is defined over F, 

e Gc is connected and simple, and has trivial center, and 

e for eacha € S*, we are given some F,-form Gz of Gc. 

Then there exists a group G that is defined over F’, such that Ga= G,, for 
eacha € S™. 
Remark on the proof. For any place o of F’,, there is a natural map 

o*: H'(Gal(C/F), Aut(Gc)) > H*(Gal(C/F,), Aut(Gc)). 
Namely, any element of the domain corresponds to an F'-form G of Gc. 
The twisted group G? is defined over o(F’), and is therefore also defined 
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over F,. Hence, it determines an element of the range. (The map can 
also be defined directly, in terms of 1-cocycles, by restricting a cocycle 
a: Gal(C/F) — Aut(Gc) to the subgroup Gal(C/F;,).) 

However, we should replace C with Q in the domain (see Correction 18.4.6(2)). 
Making this correction, and putting together the maps for the various choices 
of a, we obtain a map 


H'(Gal(Q/F), Aut(Gg)) > a H'(Gal(C/Fz), Aut(Gc)). 


The theorem is proved by showing that this map is surjective. L] 


(18.7.4) Corollary. If G is isotypic (see Definition 5.6.1), then G has a 
cocompact, irreducible, arithmetic lattice. 


Proof. Assume G has trivial center (by passing to an isogenous group), and 
write G = G! x --- x G”, where each G" is simple. Let G= G!, and as- 
sume, for simplicity, that Gt. is simple, for every i (see Exercise 2). Then, by 
assumption, Gt = Ge for every i, which means that G’ is an R-form of Ge. 

Let F’ be an extension of Q, such that F’ has exactly n places v1,..., Un, 
and all of them are real (see Lemma 18.7.8). The Borel-Harder Theorem 
(18.7.3) provides some group G that is defined over F, such that Gr,, = Ga 


for each i. Then restriction of scalars (5.5.8) tells us that Go is (isomorphic 
to) an irreducible, arithmetic lattice in []j_, Gr,, = G. 

We may assume that G” is compact (by replacing G with G x K fora 
compact group K, such that G x K is isotypic). Then Corollary 5.5.10 tells 
us that every irreducible, arithmetic lattice in G is cocompact. L 


Proof of Theorem 18.7.1. We may assume G is simple. (If [; and [2 are 
cocompact, arithmetic lattices in G; and Gg, then Ty x Tg is a cocompact, 
arithmetic lattice in G, x Gg.) Then G is isotypic, so Corollary 18.7.4 applies. 

O 


The converse of Corollary 18.7.4 is true (even without assuming cocom- 
pactness): 


(18.7.5) Proposition (cf. Proposition 5.5.12). If G has an irreducible, arith- 
metic lattice, then G is isotypic. 


By the Margulis Arithmeticity Theorem (5.2.1), there is usually no need 
to assume that the lattice is arithmetic: 
(18.7.6) Corollary. G is isotypic if 

e it has an irreducible lattice, and 


e it is not isogenous to a group of the form SO(1,n)x K or SU(1,n)x Kk, 
where K is compact. 
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We know that if G has an irreducible, arithmetic lattice that is not cocom- 
pact, then G is isotypic (see Proposition 18.7.5) and has no compact factors 
(see Corollary 5.5.10). However, the converse is not true: 


(18.7.7) Example. Every irreducible lattice in SO(3, H) x SO(1, 5) is cocom- 
pact. 


Proof. Suppose I is an irreducible lattice in SO(3, H) x SO(1,5), such that 
G/T is not compact. This will lead to a contradiction. 

The Margulis Arithmeticity Theorem (5.2.1) implies that [ is arithmetic, 
so Corollary 5.5.15 implies that [ can be obtained by restriction of scalars. 
Hence, there exist: 


e an algebraic number field F' with exactly two places 1 and o, and 
e a (connected) group G that is defined over F, 

such that 
e G is isogenous to SO(3, H), 
e G” is isogenous to SO(1,5), and 
e I’ is commensurable to A(Go) in G x G. 


Since SO(n, H) occurs only once in Propositions 18.5.6 and 18.5.7 combined, 
Gr must be of the form SU(A,7,; H ae for some A € Mat3x3(E an 
However, since G/T is not compact, the proof of Corollary 18.6.3 implies 
that G? is isomorphic to either SO(2,4) or SO(3,3); it is not isogenous to 
SO(1,5). This is a contradiction. 


We close with a stronger version of a fact that was used in the proof of 
Corollary 18.7.4: 


(18.7.8) Lemma. For any natural numbers r and s, not both 0, there is an 
algebraic number field F' with exactly r real places and s complex places. 


Proof. Let n = r+2s. It suffices to find an irreducible polynomial f(x) € Q|z] 
of degree n, such that f(x) has exactly r real roots. (Then we may let 
F = Qa), where a is any root of f(z).) 

Choose a monic polynomial g(x) € Z[z], such that 


e g(x) has degree n, 

e g(x) has exactly r real roots, and 
e all of the real roots of g(x) are simple. 

For example, choose distinct integers a1,...,a,, and let 
g(x) = (e — a1) +++ (w — ay) (a** +1). 
Fix a prime p. Exercise 5 allows us to assume 

1) g(x) =x” (mod p’), and 

2) min{ g(t) | 9'(t) = 0} >p, 
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by replacing g(x) with k”g(a/k), for an appropriate integer k. 

Let f(x) = g(x) — p. From (1), we know that f(z) = 2" — p (mod p?), 
so the Eisenstein Criterion (B4.6) implies that f is irreducible. From (2), 
we know that f(a) has the same number of real roots as g(x) (see the figure 
below). Therefore f(a) has exactly r real roots. O 


y= g(x) 


Exercises for §18.7. 


ant, 


#2. 


#3. 


4A, 


#5. 


Use restriction of scalars to construct cocompact arithmetic subgroups 
of SU(m,n) and Sp(m,n) for all m and n. 
[Hint: See the proof of Proposition 18.7.2.] 


The proof of Corollary 18.7.4 assumes that Gi. is simple for every i. 
Remove this assumption. 


[Hint: You may use Remark 18.2.2 (without proof), and you will need the 
full strength of Lemma 18.7.8.| 


Construct a noncocompact, irreducible lattice in SL(2,R) x SO(3). 


[Hint: The free group F»2 is a noncocompact lattice in SL(2,R). Let [ be the 
graph of a homomorphism F2 + SO(3) that has dense image.| 


In the proof of Example 18.7.7, show there exists v € (He), such that 
Tr(v)? Av = 0. 

(Hint: If g is a nontrivial unipotent element of Gr, then there is some w, 
such that g fixes the nonzero vector v = gu — w.| 


a) Suppose g(a) is a monic polynomial of degree n, and assume k € 
Z~ {0}, such that k g(x) € Z[z|. Show k"g(x/k) € Za]. 

b) Suppose g(x) is a monic, integral polynomial of degree n, and p is 
a prime. Show that p?"g(x/p”) = x” (mod p”). 

c) Suppose g(a) and h(x) are monic polynomials, and k and n are 
nonzero integers, such that h(x) = k"g(a/k). Show that 


min{ |h(t)| | h’(t) =0} =k" min{ |g(¢)| | g’(t) = 0}. 
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Notes 


A proof of the classification of complex semisimple Lie algebras (18.1.1,18.1.2) 
can be found in standard texts, such as [6, Thm. 11.4, pp. 57-58, and 
Thm. 18.4, p. 101]. 

The classification of real simple Lie algebras (Theorem 18.1.7) was ob- 
tained by E. Cartan [4]. (The intervening decades have led to enormous sim- 
plifications in the proof.) 

In Sections 18.3 and 18.4, our cohomological approach to the classifica- 
tion of F-forms of the classical groups is based on [10, §2.3], where full details 
can be found. See [12] for a list of all F-forms (mostly without proof), in- 
cluding exceptional groups (intended for readers familiar with root systems). 
The special case of real forms (including exceptional groups) is proved, by a 
somewhat different approach, in [5, Chap. 10]. 

Theorem 18.5.3 is due to Weil [13]. A proof (together with Proposi- 
tions 18.1.6 and 18.5.6) is in [10, §2.3, pp. 78-92]. We copied (18.1.6), (18.5.1), 
and (18.5.6) from [10, p. 92], except that [10] uses a different description of 
the groups in 18.5.1(5) (see Exercise 18.6745). 

Exercise 18.34£5 is an easy special case of the Skolem-Noether Theorem, 
which can be found in texts on ring theory, such as [9, §12.6, p. 230]. 

Exercise 18.346, the classification of division algebras over R, is due to 
Frobenius (1878), and a proof can be found in [7, pp. 452-453]. 

Theorem 18.7.3 is due to A. Borel and G. Harder [1]. See [8] for an explicit 
construction of G in the special case where the simple factors of G are classical. 

G. Prasad (personal communication) supplied Example 18.7.7. It is a 
counterexample to the noncocompact part of [8, Thm. C], which erroneously 
states that isotypic groups with no compact factors have both cocompact and 
noncocompact irreducible lattices. 

Remark 18.2.2 is a consequence of |[3, Prop. 1 of App. 2, p. 385], since a 
connected Lie group is simple if and only if its adjoint representation has no 
nonzero, proper, invariant subspaces. 

Meyer’s Theorem (used in the proof of Corollary 18.6.2) can be found in 
(2, Thm. 1 of §1.7 and Thm. 5 of §1.6, pp. 61 and 51] or [11, Cor. 2 of §4.3.2, 
p. 43]. 
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Chapter 19 


Construction of a Coarse 
Fundamental Domain 


The ordinary 2-torus is often depicted as a square with opposite sides iden- 
tified, and it would be useful to have a similar representation of [\G, so we 
would like to construct a fundamental domain for [in G. Unfortunately, it is 
usually not feasible to do this explicitly, so, as in Chapter 7, where we showed 
that SL(n, Z) is a lattice in SL(n, R), we will make do with a nice set that is 
close to being a fundamental domain: 


(19.0.1) Definition (cf. Definition 4.7.2). A subset F of G is called a coarse 
fundamental domain for T in G if 

1) TF =G, and 

2) {yET| FN VF FO} is finite. 


The main result is Theorem 19.2.2, which states that the desired set F 
can be constructed as a finite union of (translates of) “Siegel sets” in G. 
Applications of the construction are described in Section 19.3. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, Tis a lattice in the semisimple Lie group G C SL(@, R). 


You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: Q-rank (Chapter 9). Recommended: 
Siegel sets for SL(n, Z) (sections 7.1 to 7.3). 
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§19.1. What is a Siegel set? 


Before defining Siegel sets in every semisimple group, we recall the following 
special case: 


(19.1.1) Definition (cf. Definition 7.2.4). A Siegel set for SL(n, Z) is a set 
of the form 84, = NA. K C SL(n,R), where 


e N is a compact subset of the group N of upper-triangular unipotent 
matrices, 

eA. = {a EA | Qi-1i—-1 2 CQi,i fori =1,...,n— 1}, where A is the 
group of positive-definite diagonal matrices (and c > 0), and 


e kK =SO(n). 


In this section, we generalize this notion by replacing SL(n, Z) with any 
arithmetic subgroup (or, more generally, any lattice) in any semisimple Lie 
group G. To this end, note that the subgroups NV, A, and K above are the 
components of the Iwasawa decomposition G = KAN (or G= NAK), which 
can be defined for any semisimple group (see Theorem 8.4.9): 


e N is a maximal unipotent subgroup of G. 


e Ais a maximal R-split torus of G that normalizes N, and 
e K is a maximal compact subgroup of G. 


Now, to construct Siegel sets in the general case, we will do two things. 
First, we rephrase Definition 19.1.1 in a way that does not refer to any specific 
realization of G as a matrix group. To this end, recall that, for G = SL(n, R), 
the positive Weyl chamber is 


At= {aeEA | Qi > Ai+1,i4+1 for gS less. wl 
Therefore, in the notation of Definition 19.1.1, we have At = Aj, and, for 


any c > 0, it is not difficult to see that there exists some a € A, such that 
A, = aA? (see Exercise 7.2#11). Therefore, letting C = Na, we see that 


SH = CAtK, and C is a compact subset of NA. 


This description of Siegel sets can be generalized in a natural way to any 
semisimple group G. 

However, all of the above is based entirely on the structure of G, with no 
mention of I, but a coarse fundamental domain F needs to be constructed 
with a particular arithmetic subgroup Tin mind. For example, if [\G is 
compact (or, in other words, if rankgI = 0), then our coarse fundamental 
domain needs to be compact, so none of the factors in the definition of a 
Siegel set can be unbounded. Therefore, we need to replace the maximal R- 
split torus A with a smaller torus S that reflects the choice of a particular 
subgroup [. In fact, S will be the trivial torus when G/T is compact. In 
general, S is a maximal Q-split torus of G (hence, S is compact if and only if 
I'\G is compact). 
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Now, if S is properly contained in A, then N SK is not allof G. Hence, NS 
will usually not be the appropriate replacement for the subgroup N A. Instead, 
if we note that NA is the identity component of a minimal parabolic subgroup 
of SL(n, R) (see Example 8.4.4(1)), and that NA is obviously defined over Q, 
then it is natural to replace NA with a minimal parabolic Q-subgroup P of G. 

The following definition implements these considerations. 

(19.1.2) Definition. Assume 

e G is defined over Q, 

e I’ is commensurable to Gz, 

e P isa minimal parabolic Q-subgroup of G, 

e Sis a maximal Q-split torus that is contained in P, 

e S* is the positive Weyl chamber in S (with respect to P), 

e K is a maximal compact subgroup of G, and 

e Cis any nonempty, compact subset of P. 

Then 8=8G=CS'TK isa Siegel set for Tin G. 
(19.1.3) Warning. Our definition of a Siegel set is slightly more general than 
what is usually found in the literature, because other authors place some 


restrictions on the compact set C’. For example, it is often assumed that C’ 
has nonempty interior. 


Exercises for §19.1. 


#1. Show that if 8 = CSTK is a Siegel set, then there is a compact sub- 
set C’ of G, such that 8 C STC’. 


[Hint: Conjugation by any element of St centralizes MS and contracts N 
(where P = MSN is the Langlands decomposition).] 


#2. For every compact subset C' of G, show there is a Siegel set that con- 
tains C’. 
[Hint: Exercise 9.3#4.] 


§19.2. Coarse fundamental domains made from Siegel sets 


(19.2.1) Example. Let 
e G=SL(2,R), 


e S bea Siegel set that is a coarse fundamental domain for SL(2, Z) in G 
(see Figure 7.2A(b)), and 


e I be a subgroup of finite index in SL(2, Z). 
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Then S may not be a coarse fundamental domain for I, because [S may not 
be all of G. In fact, if the hyperbolic surface ['\\S§) has more than one cusp, 
then no Siegel set is a coarse fundamental domain for I. 

However, if we let F' be a set of coset representatives for I in SL(2, Z), 
then F'S is a coarse fundamental domain for [ (see Exercise 7.247(b)). 


From the above example, we see that a coarse fundamental domain can 
sometimes be the union of several translates of a Siegel set, even in cases 
where it cannot be a single Siegel set. In fact, this construction always works 
(if [ is arithmetic): 

(19.2.2) Reduction Theory for Arithmetic Groups. Jf Tis commensu- 
rable to Gz, then there exist a Siegel set 8 and a finite subset F of Gg, such 
that F = F'8 is a coarse fundamental domain for T in G. 


The proof will be given in Section 19.4. 


Although the statement of this result only applies to arithmetic lattices, 
it can be generalized to the non-arithmetic case. However, this extension 
requires a notion of Siegel sets in groups that are not defined over Q. The 
following definition reduces this problem to the case where I is irreducible. 


(19.2.3) Definition. If 8; is a Siegel set for [; in G;, for i = 1,2,...,n, then 
81 X 82 X+:: X &y, 


is a Stegel set for the lattice Ty x --- x Ty, in Gi x --- xX Gy. 


Then, by the Margulis Arithmeticity Theorem (5.2.1), all that remains is 
to define Siegel sets for lattices in SO(1,n) and SU(1,n), but we can use the 
same definition for all simple groups of real rank one: 


(19.2.4) Definition. Assume G is simple, rankg G = 1, and K is a maximal 
compact subgroup of G. 
0) If rankg I = 0, and C is any compact subset of G, then 8 = CK isa 
Siegel set in G. 
1) Assume now that rankg I = 1. Let P be a minimal parabolic subgroup 
of G, with Langlands decomposition P = MAN, such that 
IQ N is a maximal unipotent subgroup of I. (19.2.4) 
If 
e Cis any compact subset of P, and 
e A* is the positive Weyl chamber of A (with respect to P), 
then 8S=CATK isa generalized Siegel set in G. 


(19.2.5) Remark. If [ is commensurable to Gz (and G is defined over Q), then 
(19.2.4N) holds if and only if P is defined over Q (and is therefore a minimal 
parabolic Q-subgroup). 


We can now state a suitable generalization of Theorem 19.2.2: 
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(19.2.6) Theorem. If G has no compact factors, then there exist a general- 
ized Siegel set S§ and a finite subset F of G, such that F = FS is a coarse 
fundamental domain for T in G. 


The proof is essentially the same as for Theorem 19.2.2. 


Exercises for §19.2. 


#1. Without using any of the results in this chapter (other than the defi- 
nitions of “Siegel set” and “coarse fundamental domain” ), show that if 
rankg Il’ = 0, then some Siegel set is a coarse fundamental domain for T° 
in G. 


#2. Suppose F; and F»2 are coarse fundamental domains for T, and [2 in 
G, and Go, respectively. Show that F, x F2 is a coarse fundamental 
domain for [T; x Tg in Gy x Go. 


#3. Suppose N is a compact, normal subgroup of G, and let I’ be the image 
of fin G=G/N. Show that if F is a coarse fundamental domain for 
in G, then 

F={gEG|gNeF} 
is a coarse fundamental domain for [ in G. 


#4. If G is simple, rankg G = 1, and G is defined over Q, then Defini- 
tions 19.1.2 and 19.2.4 give two different definitions of the Siegel sets 
for Gz. Show that Definition 19.2.4 is more general: any Siegel set ac- 
cording to Definition 19.1.2 is also a Siegel set by the other definition. 
[Hint: Remark 19.2.5.] 


§19.3. Applications of reduction theory 


Having a coarse fundamental domain is very helpful for understanding the 
geometry and topology of [\G. Here are a few examples of this (with only 
sketches of the proofs). 


§19.3(i). [I is finitely presented. Proposition 4.7.7 tells us that if T 
has a coarse fundamental domain that is a connected, open subset of G, 
then I is finitely presented. The coarse fundamental domains constructed in 
Theorems 19.2.2 and 19.2.6 are closed, rather than open, but it is easy to deal 
with this minor technical issue: 


(19.3.1) Definition. A subset 8, of G is an open Siegel set if8, =OS*K, 
where © is a nonempty, precompact, open subset of P. 


Choose a maximal compact subgroup K of G that contains a maximal 
compact subgroup of Cg($'). Then we may let: 
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e F = FS be a coarse fundamental domain, with 8 = C St K, such that 
C C P® and F is connected (see Exercise 4), 

e © bea connected, open, precompact subset of P° that contains C, 

e 8, =OS*K be the corresponding open Siegel set, and 

0 F=FS,. 
Then F is a coarse fundamental domain for I (see Exercise 5), and F is both 
connected and open. 

This establishes Theorem 4.7.10, which stated (without proof) that I is 
finitely presented. 


§19.3(ii). Mostow Rigidity Theorem. When rankgI; = 1, G. Prasad 
constructed a quasi-isometry y: G;/K, — G2/K 2 from an isomorphism 
p: 1, — Te, by using the Siegel-set description of the coarse fundamental 
domain for [;\G;. This completed the proof of the Mostow Rigidity Theo- 
rem (15.1.2). 


§19.3(iii). Divergent torus orbits. 


(19.3.2) Definition. Let T be an R-split torus in G, and let c € G/T. We 
say the T-orbit of x is divergent if the natural map T’ — Tx is proper. 


(19.3.3) Theorem. rankgT is the maximal dimension of an R-split torus that 
has a divergent orbit on G/T. 


We start with the easy half of the proof: 


(19.3.4) Lemma (cf. Exercise 2). Assume G is defined over Q, and let S be a 
maximal Q-split torus in G (so dim S = rankg Gz). Then the S-orbit of eGz 
is divergent. 


Now, the other half: 


(19.3.5) Theorem. [fT is an R-split torus, and dimT > rankgI, then no 
T-orbit in G/T is divergent. 


Proof (assuming rankgIl = 1). Let T be a 2-dimensional, R-split torus T’ 
of G, and define 7: T > G/T, by z(t) = tI’. Suppose z is proper. (This will 
lead to a contradiction.) 

Let P be a minimal parabolic Q-subgroup of G, and let S be a maximal 
Q-split torus in P. For simplicity, let us assume that [ = Gz, and also that 
a single open Siegel set 5, = K.S~O provides a coarse fundamental domain 
for T in G. (Note that, since we are considering G/T, instead of [\\G,, we have 
reversed the order of the factors in the definition of the Siegel set, and we use 
the opposite Weyl chamber.) 

Choose a large, compact subset C' of G/T, and let Tr be a large circle 
in T that is centered at e. Since 7 is proper, we may assume T'p is so large 
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that 7(TR) is disjoint from C. Since Tp is connected, this implies 7(T) is 
contained in a connected component of the complement of C’. So there exists 
y €T, such that Tr C 8Pzy (cf. Remark 19.4.10 below). 

Let t € Tr, and assume, for simplicity, that y = e. Then t € SPz, 
and, since Tz is closed under inverses, we see that SPz also contains t~'. 
However, it is not difficult to see that conjugation by any large element of 
SPz expands the volume form on P (see Exercise 7). Since the inverse of an 
expanding element is a contracting element, not an expanding element, this 
is a contradiction. 


Theorem 19.3.3 can be restated in the following geometric terms: 


(19.3.6) Theorem (see Theorem 2.2.1). rankgIT is the largest natural num- 
ber r, such that some finite cover of the locally symmetric space [\G/K con- 
tains a closed, simply connected, r-dimensional flat. 


§19.3(iv). The large-scale geometry of locally symmetric spaces. 
If we let 7: G > [\G/K be the natural map, then it is not difficult to see 
that the restriction of 7 to any Siegel set is proper (see Exercise 6). In fact, 
with much more work (which we omit), it can be shown that the restriction 
of 7 is very close to being an isometry: 
(19.3.7) Theorem. If 8=C STK is any Siegel set, and 

tw: G—+T\G/K is the natural map, 
then there exists c € R™, such that, for all x,y € 8, we have 
d(n(x),m(y)) < d(a,y) < d(x(2x), r(y)) +e. 

This allows us to describe the precise shape of the the locally symmetric 
space associated to I’, up to quasi-isometry: 
(19.3.8) Theorem. Let 

e X =I \G/K be the locally symmetric space associated to T, and 

e r=rankgl. 
Then X is quasi-isometric to the cone on a certain (r — 1)-dimensional sim- 
plicial complex at oo. 
Idea of proof. Modulo quasi-isometry, any features of bounded size in X 
can be completely ignored. Note that: 


e Theorem 19.3.7 tells us that, up to a bounded error, S looks the same 
as its image in X. 

e There is a compact subset C’ of G, such that 8 C S*C’ (see Exer- 
cise 19.1441), so every element of 8 is within a bounded distance of ST. 
Therefore, 8 and S* are indistinguishable, up to quasi-isometry. 
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Then, since F = F’'S covers all of X, we conclude that X is quasi-isometric 
to User fS*. 

The Weyl chamber S* is a cone; more precisely, it is the cone on an (r—1)- 
simplex at oo. Therefore, up to quasi-isometry, X is the union of these finitely 
many cones, so it is the cone on some (r — 1)-dimensional simplicial complex 


at oo. 


O 


(19.3.9) Remarks. 


1) 


The same argument shows that we get the same picture if, instead of 
looking at X modulo quasi-isometry, we look at it from farther and 
farther away, as in the definition of the asymptotic cone of X in Def- 
inition 2.2.6. Therefore, the asymptotic cone of X is the cone on a 
certain (r — 1)-dimensional simplicial complex at oo. This establishes 
Theorem 2.2.8. 


For a reader familiar with “Tits buildings? the proof (and the construc- 
tion of F) shows that this simplicial complex at oo can be constructed 
by taking the Tits building of parabolic Q-subgroups of G, and modding 
out by the action of I. 


Exercises for §19.3. 


#1. 


42. 


#3. 


4A, 


#5. 


Show that [has only finitely many conjugacy classes of finite subgroups. 


[Hint: If H is a finite subgroup of I, then H% C K, for some g € G. Write 
g= yx, with y €T anda ée F. Then Hx = «- H% C Ff, so H is conjugate 
to a subset of {y ET | FNyF £O}-.] 


Let G = SL(n, R), P = SL(n, Z), and S be the group of positive-definite 
diagonal matrices. Show the S-orbit of Te is proper. 


[Hint: If s;,;/si, is large, then conjugation by s contracts a unipotent ma- 
trix y whose only off-diagonal entry is ¥:,;-.| 


Show that every open Siegel set is an open subset of G (so the termi- 
nology is consistent). 


Assume 
e K contains a maximal compact subgroup of Cq(S), 
e Cis a compact subset of P, and 
e F isa finite subset of G. 
Show there is a compact subset C, of P°, such that CStTK C C,STK 
and FC,S* K is connected. 
[Hint: Show P°(K MCp(S)) = P|] 


Show the set F constructed in Subsection 19.3(i) is indeed a coarse 
fundamental domain for [ in G. 
[Hint: Exercise 7.2#2.] 
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#6. Let 7: G > I\G be the natural map. Show that if 8 = CSTK isa 
Siegel set for [, then the restriction of 7 to S is proper. 
[Hint: Let v be a nontrivial element of NOT. If g is a large element of 8, 
then v9 ~ e.] 

#7. Show that if O is contained in a compact subset of P, then conjugation 
by any large element of K.S~O Pz expands the Haar measure on P. 


[Hint: Conjugation by any element of MUN preserves the measure, conjuga- 
tion by an element of O is bounded, and S~ centralizes SM and expands N. 
Also note that Pz = MzNz (see Exercise 19.444).| 


§19.4. Outline of the proof of reduction theory 


(19.4.1) Notation. Throughout this section, we assume 
e G is defined over Q, 
e I’ is commensurable to Gz, and 


e P is a minimal parabolic Q-subgroup of G, with Langlands decompo- 
sition P= MSN. 


In order to use Siegel sets to construct a coarse fundamental domain, a 
bit of care needs to be taken when choosing a maximal compact subgroup K. 
Before stating the precise condition, we recall that the Cartan involution 
corresponding to K is an automorphism 7 of G, such that 7? is the identity, 
and K is the set of fixed points of 7. (For example, if G = SL(n,R) and 
K = SO(n), then r(g) = (g7)~1 is the transpose-inverse of g.) 


(19.4.2) Definition. A Siegel set 8 = CStK is normal if S is invariant 
under the Cartan involution corresponding to K. 


Fix a normal Siegel set 8 = C'S*tK, and some finite F C Gg. Then, 
letting F = F'S, the proof of Theorem 19.2.2 has two parts, corresponding to 
the two conditions in the definition of coarse fundamental domain (19.0.1): 


i) Sand F can be chosen so that 1 ¥ = G (see Theorems 19.4.3 and 19.4.4), 
and 


ii) for all choices of S and F, the set {y € T | FN yF £ O} is finite 
(see Theorem 19.4.8). 


We will sketch proofs of both parts (assuming rankgI = 1). However, as a 
practical matter, the methods of proof are not as important as understand- 
ing the construction of the coarse fundamental domain as a union of Siegel 
sets (see Section 19.2), and being able to use this in applications (as in Sec- 
tion 19.3). 
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§19.4(i). Proof that [TF = G. Here is the rough idea: Fix a base 
point in T\G. A Siegel set can easily cover all of the nearby points (see Ex- 
ercise 19.142), so consider a point Ig that is far away. Godement’s Criterion 
(5.3.1) implies there is some nontrivial unipotent v € T, such that v9 & e. 
Replacing g with a different representative of the coset replaces v with a con- 
jugate element. If we assume all the maximal unipotent Q-subgroups of G are 
conjugate under I’, this implies that we may assume v € N. If we furthermore 
assume, for simplicity, that the maximal Q-split torus S is actually a maxi- 
mal R-split torus, then the Iwasawa decomposition (8.4.9) tells us G = NSK. 
The compact group K is contained in our Siegel set 8, and the subgroup N is 
contained in T'S if S is sufficiently large, so let us assume g € S. Since g con- 
tracts the element v of N, and, by definition, S* consists of the elements of $ 
that contract N, we conclude that g € S* C8. 


We now explain how to turn this outline into a proof. 

Recall that all minimal parabolic Q-subgroups of G are conjugate un- 
der Gg (see Proposition 9.3.6(1)). The following technical result from the 
algebraic theory of arithmetic groups asserts that there are only finitely many 
conjugacy classes under the much smaller group Gz. In geometric terms, it 
is a generalization of the fact that hyperbolic manifolds of finite volume have 
only finitely many cusps. 


(19.4.3) Theorem. There is a finite subset F of Gg, such that l F Po = Go. 


The finite subset F’ provided by the theorem can be used to construct the 
coarse fundamental domain F: 


(19.4.4) Theorem. Jf F is a finite subset of Gg, such that TF Pg = Gg, then 
there is a (normal) Siegel set 8 =CS*K, such that TF8 =G. 


Idea of proof (assuming rankgI <1). For simplicity, assume = Gz, and 
that F' = {e} has only one element (see Exercise 6), so 


I Po = Go. (19.4.5) 


The theorem is trivial if is cocompact (see Exercise 19.241), so let us assume 
rankgT = 1. 

From the proof of the Godement Compactness Criterion (5.3.1), we have 
a compact subset Co of G, such that, for each g € G, either g € ['Co, or there 
is a nontrivial unipotent element v of I’, such that v% & e. By choosing C 
large enough, we may assume Cp C § (see Exercise 19.142). 

Now suppose some element g of G is not in T'S. Then g ¢ ['Co, so there 
is a nontrivial unipotent element v of I, such that 


OF Ae: (19.4.6) 


From (19.4.5), we see that we may assume v € N, after multiplying g on the 
left by an element of I (see Exercise 2). 
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We have G = PK (cf. Exercise 9.3#4). Furthermore, P = MSN, and 
I’ intersects both M and N in a cocompact lattice (see Example 9.1.5(3), 
Theorem 9.3.3(3), and Exercise 9.3#6). Therefore, if we multiply g on the 
left by an element of [9 P, and ignore a bounded error, we may assume g € S 
(see Exercise 3). Then, since rankgT = 1, we have either g € S* or g-' € S$ 
(see Exercise 5). From (19.4.6), we conclude it is g that is in S*. So g € 8, 
which contradicts the fact that g ¢ TS. 


(19.4.7) Remark. The above proof overlooks a technical issue: in the Lang- 
lands decomposition P = MSN, the subgroup M may be reductive, rather 
than semisimple. However, the maximality of S implies that the central 
torus T’ of M has no R-split subtori (cf. Theorem 9.3.3(3)), so it can be 
shown that this implies T/Tz is compact. Therefore M/Mz is compact, even 
if M is not semisimple. 


§19.4(ii). Proof that S intersects only finitely many [’-translates. 
We know that I is commensurable to Gz. Therefore, if we make the minor 
assumption that G'c has trivial center, then T C Gg (see Exercise 5.244). 
Hence, the following result establishes Condition 19.0.1(2) for F = FS: 


(19.4.8) Theorem (“Siegel property”). If 
e S=CSTK is anormal Siegel set, and 
eqg€Gg, 
then {y € Gz| q8SN 78 £0} is finite. 
Proof (assuming rankgI’ < 1). The desired conclusion is obvious if 8 is 
compact, so we may assume rankgI’ = 1. To simplify matters, let [ = Gz, 
and 
assume q = € is trivial. 
The proof is by contradiction: assume 
=a", 
for some large element y of IT, and some o,0’ € 8S. Since 7¥ is large, we may 
assume @ is large (by interchanging o with o’ and replacing y with y~!, if 
necessary). Let 
u be an element of Nz of bounded size. 
Since 0 € 8 = CS*K, we may write 
o =csk withc€ C,s€St,andke K. 


Then s must be large (since K and C are compact), so conjugation by s 
performs a large contraction on N. Since u° is an element of N of bounded 
size, and K is compact, this implies that u° * e. In other words, 
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In addition, we know that uY € Gz. Since o’ € 8, we conclude that u7 € N 
(see Exercise 7). 
Now we use the assumption that rankg I = 1: since 


ur ENON’, 
Lemma 9.2.5 tells us that N = N7, so Proposition 9.3.6(3) implies 
7 €No(N) =P. 
Then, since (VN )z has finite index in Pz (see Exercise 4), we may assume 


7 € (MN)z. 
This implies that we may work inside of MN: if we choose a compact 
subset C C MN, such that CS+ C C'S, then we have 


C(KNP)NyC(KNP) #0 (19.4.9) 
(see Exercise 9). Since C(K MP) is compact (and I is discrete), we conclude 
that there are only finitely many possibilities for y. LJ 


(19.4.10) Remark. When rankgI = 1, the first part of the proof establishes 
the useful fact that there is a compact subset Cp of G, such that if y € I, and 
SN 75 EZ Co, then y € P. 


Exercises for §19.4. 


#1. Show that every Q-split torus of G is invariant under some Cartan 
involution of G. (Therefore, for any maximal Q-split torus S$, there 
exists a maximal compact subgroup K, such that the resulting Siegel 
sets CS* K are normal.) 

(Hint: If r is any Cartan involution, then there is a maximal R-split torus A, 


such that t(a) = a~' for alla € A. Any R-split torus is contained in some 
conjugate of A.] 


#2. In the proof of Theorem 19.4.4, explain why it may be assumed that 
veEN. 
[Hint: Being unipotent, v is contained in the unipotent radical of some min- 
imal parabolic Q-subgroup (see Proposition 9.3.6(2)). Since [ Po = Ga, we 
know that all minimal parabolic Q-subgroups are conjugate under I.] 

#3. In the proof of Theorem 19.4.4, complete the proof without assuming 
that g € S. 
[Hint: Write g = pk € PK. If C is large enough that MN C (('M P)C, then 
we have g € csk CT CSK, so v* ¥ e.] 

#4. Show that if P = MAN is a Langlands decomposition of a parabolic 
Q@subgroup of G, then (MN)z contains a finite-index subgroup of Pz. 
[Hint: A Q-split torus can have only finitely many integer points.] 


#5. Show that if rankgI = 1, then S = St U(ST)7}. 
[Hint: If s € S*, then s* € St for all t € Q* (and, hence, for all t € R*).] 
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7##6. Prove Theorem 19.4.4 in the case where rankg I = 1. 


[Hint: Replace the imprecise arguments of the text with rigorous statements, 
and do not assume F is a singleton. (In order to assume v € N, multiply g on 


the left by an element x of FT. Then 2I'a~'! contains a finite-index subgroup 
of [MN P.)| 


#7. Let 8S=CS*TK bea Siegel set, and let P be the minimal parabolic Q- 
subgroup corresponding to St. Show there is a neighborhood W of e 
in G, such that if y € Gz and y° € W, for some a € S, then y € unip P. 


#8. Suppose 
e 7 is the Cartan involution corresponding to the maximal compact 
subgroup K, and 
e S is a T-invariant. 
Show KN PCM CCcg(S). 
[Hint: Since Cg¢(S$) is T-invariant, the restriction of r to the semisimple part 
of M is a Cartan involution. Therefore kK MM contains a maximal compact 


subgroup of M, which is a maximal compact subgroup of P. The second 
inclusion is immediate from the definition of the Langlands decomposition.| 


#9. Establish (19.4.9). 


[Hint: SA PN>(SNP) # O (since y € P) and 8N P= CSt(KNP) = 
C(K 1 P)S* (see Exercise 8).] 


#10. Give a complete proof of Theorem 19.4.8. 


#11. Show that [ has only finitely many conjugacy classes of maximal unipo- 
tent subgroups. 


[Hint: You may assume, for simplicity, that [ = Gz is arithmetic. Use 
Proposition 9.3.6(2) and Theorem 19.4.3.] 


Notes 


The main results of this chapter were obtained for many classical groups 
by L. Siegel (see, for example, [8]), and the general results are due to A. Borel 
and Harish-Chandra [2]. 

The book of A. Borel [1] is the standard reference for this material; see 
(1, Thm. 13.1, p. 90] for the construction of a fundamental domain for Gz as 
a union of Siegel sets. The proof there does not assume G7z is a lattice, so 
this provides a proof of the fundamental fact that every arithmetic subgroup 
of G is a lattice (see Theorem 5.1.11). See [6, §4.6] for an exposition of Borel 
and Harish-Chandra’s original proof of this fact (using the Siegel set S¢, ,5,¢3 
for SL(n, Z) from Definition 7.2.4). 

Theorem 19.3.7 was conjectured by C.L. Siegel in 1959, and was proved 
by L. Ji [4, Thm. 7.6]. Another proof is in E. Leuzinger [5, Thm. B]. 

The construction of coarse fundamental domains for non-arithmetic lat- 
tices in groups of real rank one is due to Garland and Raghunathan [3]. 


404 19. CONSTRUCTION OF A COARSE FUNDAMENTAL DOMAIN 


(An exposition appears in [7, Chap. 13].) Combining this result with Theo- 
rem 19.2.2 yields Theorem 19.2.6. 

Exercise 19.341 can be found in [6, Thm. 4.3, p. 203]. 

Regarding Remark 19.4.7, see [1, Prop. 8.5, p. 55] or [6, Thm. 4.11, p. 208] 
for a proof that if T is a Q-torus that has no Q-split subtori, then T/Tz is 
compact. 
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Chapter 20 


Ratner’s Theorems 
on Unipotent Flows 


This chapter presents three theorems that strengthen and vastly generalize 
the well-known and useful observation that if V is any straight line in the 
Euclidean plane R?, then the closure of the image of L in T? is a very nice 
submanifold (see Example 20.1.1). The plane can be replaced with any Lie 
group H, and V can be any subgroup of H that is generated by unipotent 
elements. 


§20.1. Statement of Ratner’s Orbit-Closure Theorem 
(20.1.1) Example. Let 


e V be any 1-dimensional subspace of the vector space R?, 
e T? = R?/Z? be the ordinary 2-torus, 
e x €T’, and 


e 7: R? > T? be the natural covering map. 


Geometrically, V + x is a straight line in the plane, and it is classical (and 
not difficult to prove) that: 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, Tis a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 


Main prerequisites for this chapter: none. 
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1) If the slope of the line V + a is rational, then 7(V + 2) is closed, and 
it is homeomorphic to the circle T! (see Exercise 1). 


2) If the slope of the line V + z is irrational, then the closure of 7(V + x) 
is the entire torus T? (see Exercise 2). 


An analogous result holds in higher dimensions: if we take any vector 
subspace of R*, the closure of its image in T¢ will always be a nice submanifold 
of T’. Indeed, the closure will be a subtorus of T¢: 

(20.1.2) Example (see Exercise 3). Let 

e V bea vector subspace of R°, 

e T’ = R*/Z be the ordinary ¢-torus, 

e «eT, and 


e 7: R’ > T° be the natural covering map. 


Then the closure of 7(V + x2) in T’ is homeomorphic to a torus T* (with 
O0<k<2). 

More precisely, there is a vector subspace L of R*, such that 

e the closure of 7(V + x) is 7(L + 2), and 

e L is defined over Q (or, in other words, LN Z is a Z-lattice in L), so 

n(L +2) & L/(LNZ*) is a torus. 

The above observation about tori generalizes in a natural way to much 
more general homogeneous spaces, by replacing: 

e R° with any connected Lie group H, 

e Z° with a lattice A in H, 


e the vector subspace V of R® with any subgroup of H that is generated 
by unipotent elements, 


e x+V with the coset xV, 
e the map 7: R’ > T* with the natural covering map 7: H > H/A, and 


e the vector subspace L of R‘ with a closed subgroup L of H. 


Because it suffices for our purposes, we state only the case where H is semisim- 
ple (so we call the group G, instead of H): 


(20.1.3) Ratner’s Orbit-Closure Theorem. Suppose 
e V is a subgroup of G that is generated by unipotent elements, and 
e te€G/T. 


Then there is a closed subgroup L of G, such that the closure of Vx is Lx. 
Furthermore, L can be chosen so that: 


1) L contains the identity component of V, 

2) 

3) there is an L-invariant, finite measure on La. 
( 


Also note that La is closed in G/T, since it is the closure of Vz.) 


L has only finitely many connected components, and 
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(20.1.4) Remark. Write x = gI’, for some g € G, and let A = (gg!) NL. 


1) The theorem tells us that the closure of Vz is a very nice submanifold 
of G/T. Indeed, the closure is homeomorphic to the homogeneous space 
C/A. 

2) Conclusion (3) of the theorem is equivalent to the assertion that A is a 
lattice in L. 


(20.1.5) Warning. The assumption that V is generated by unipotent ele- 
ments cannot be eliminated. For example, it is known that if V is the group 
of diagonal matrices in G = SL(2, R), then there are points « € G/SL(2, Z), 
such that the closure of Vz is a fractal. This means that the closure of Vx 
can be a very bad set that is not anywhere close to being a submanifold. 


Unfortunately, the known proofs of Ratner’s Orbit-Closure Theorem are 
rather long. One of the paramount ideas in the proof will be described in 
Section 20.4, but, first, we will present a few of the theorem’s applications (in 
Section 20.2) and state two other variants of the theorem (in Section 20.3). 


Exercises for §20.1. 


#1. Verify Example 20.1.1(1). 
#2. Verify Example 20.1.1(2). 
#3. Verify Example 20.1.2. 


#4. Show that if V is connected, then the subgroup L in the conclusion of 
Theorem 20.1.3 can also be taken to be connected. 


#5. (Non-divergence of unipotent flows) Suppose 
e {ut} is a unipotent one-parameter subgroup of G, and 
e te€G/T. 
Use Theorem 20.1.3 to show there is a compact subset K of G, such 
that 


{t€R| ua € K } is unbounded. 


(Hence, Theorem 7.4.6 is logically a corollary of Theorem 20.1.3. However, 
in practice, Theorem 7.4.6 is used in the proof of Theorem 20.1.3. 


[Hint: Conclusion (3) of Theorem 20.1.3 is crucial.| 


#6. Show, by providing an explicit counterexample, that the assumption 
that {u’} is unipotent cannot be eliminated in Exercise 5. 


[Hint: Consider the one-parameter group of diagonal matrices in SL(2,R), 
and let [ = SL(2, Z).] 
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§20.2. Applications 


We will briefly describe just a few of the many diverse applications of Ratner’s 
Orbit-Closure Theorem (20.1.3). 


§20.2(i). Closures of totally geodesic subspaces. 


(20.2.1) Example. Let M be a compact, 9) 
hyperbolic n-manifold (with n > 2). ee a” 
e There is a covering map 7: 9” > M 
that is a local isometry. 


e There is a natural embedding 
UH GH”. 
e Let f; be the composition 7 0 1, 
so fi: Hi > M. 
Then the image of f; is a curve in M. It is well known (though not at all 
obvious) that the closure of this curve can be a very bad set; in fact, even 
though §! and M are nice, smooth manifolds, this closure can be a fractal. 
(This is a higher-dimensional analogue of the example in Warning 20.1.5. 
In the literature, it is the fact that the closure of a geodesic in a compact 
manifold of negative curvature can be a fractal.) 
It is a consequence of Ratner’s Theorem that this pathology never occurs 
if we replace §! with a higher-dimensional hyperbolic space: 


(20.2.2) Corollary. Let: 
em, neéEN, withm<n, 
e M be a compact, hyperbolic n-manifold, 
ez: 9” —M be a covering map that is a local isometry, 
eu: 9" SH” be a totally geodesic embedding, and 
© fm=Tol, 80 fm: HN" > M. 


Ifm > 2, then the closure fm(™) of the image of fm is a (totally geodesic) 
immersed submanifold of M. 


Proof. We prove only that the closure is a submanifold, not that it is totally 
geodesic. Let 
V =SO(1,m), G=SO(1,n), and x €1(H”), 
SO 
e G acts by isometries on 9”, 
e M =I\S", for some lattice T in G, and 
e (9) =Vz. 
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From Ratner’s Orbit-Closure Theorem (20.1.3), we know there is a subgroup L 
of G, such that ['V =TL. So 
fin(N™) = 7((™)) = 1(Va) = 1(La) 


is an immersed submanifold of M. 


(20.2.3) Remarks. 


1) The same conclusion holds (with the same proof) when § and §)” are 
replaced with much more general symmetric spaces X and Y that have 
no compact factors, except that the closure may not be totally geodesic 
if rank X <rankY. 

2) When rank X = rank Y, one proves that the submanifold is totally 
geodesic by showing that the subgroup L in the above proof is invariant 
under the appropriate Cartan involution of G. 


§20.2(ii). Values of quadratic forms. Many of the most impressive 
applications of Ratner’s Orbit-Closure Theorem (and the related results that 
will be described in Section 20.3) are in Number Theory. As an example, 
we present a famous result on values of quadratic forms. It is now an easy 
corollary of Ratner’s Orbit-Closure Theorem, but, historically, it was proved 
by Margulis before this major theorem was available (by proving the relevant 
special case of the general theorem). 

Let 

Q(@) = Q(a1,72,...,£n) be a quadratic form in n variables 


(in other words, Q(Z) is a homogeneous polynomial of degree 2). 

Classical number theorists were interested in determining the values of c 
for which the equation Q(@) = ¢ has an integer solution; that is, a solution 
with & € Z”. For example: 


1) Lagrange’s 4-Squares Theorem tells us that if 
Q(#1, %2,03,04) = 2] +05 +03 + 24, 
then Q(@) =c has a solution iff c € Z2°. 
2) Fermat’s 2-Squares Theorem tells us that if Q(v1, 72) = x7 + x3, and 
p is an odd prime, then Q(@ ) = p has a solution iff p = 1 (mod 4). 


These very classical results consider only forms whose coefficients are integers, 
but we can also look at forms with irrational coefficients, such as 


OG 99, 03504) = 3x7 — V2ror3 + nx. 
For a given quadratic form Q(7 ), it is clear that 


the equation Q(z) = c does not have an integral solution, 
for most real values of c, 
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for the simple reason that there are only countably many possible integer 
values of the variables 71, 72,...,2@n, but there are uncountably many possible 
choices of c. Therefore, instead of trying to solve the equation exactly, we 
must be content with solving the equation approximately. That is, we will be 
satisfied with knowing that we can find a value of Q(@ ) that is within € of c, 
for every € (and every c). In other words, we would like to know that Q(Z”) 
is dense in R. 


(20.2.4) Examples. There are some simple reasons that Q(Z”) may fail to 
be dense in R: 


1) Suppose all of the coefficients of Q(@) are integers. Then we have 
Q(Z") C Z, so Q(Z") is obviously not dense in R. More generally, 
if Q(@) is a scalar multiple of a form with integer coefficients, then 
Q(Z”) is not dense in R. 


2) Suppose all values of Q(@ ) are > 0 (or all are < 0). (In this case, we say 


that Q(@) is positive-definite (or negative-definite, respectively). 
For example, this is the case if 
Q(@) = a,x? + aga + aged +--+ anz?, 

with all coefficients a; of the same sign. Then it is clear that Q(Z”) is 

not dense in all of R. 
3) Let Q(a1, 72) = x7 — ax}, where a = 3+2,/2. Then, although it is not 

obvious, one can show that Q(Z?) is not dense in R (see Exercise 3). 

Certain other choices of a also provide examples where Q(Z?) is not 

dense (see Exercise 2), so having only 2 variables in the quadratic form 

can cause difficulties. 


4) Even if a form has many variables, there may be a linear change of 
coordinates that turns it into a form with fewer variables. (For example, 
letting z = « + V2y transforms x? + 2/2zry + 2y? into z?.) A form 
that admits such a change of coordinates is said to be degenerate. 
Therefore, a degenerate form with more than 2 variables could merely 
be a disguised version of a form with 2 variables whose image is not 
dense in R. 


The following result shows that any quadratic form avoiding these simple 
obstructions does have values that are dense in R. It is often called the 
“Oppenheim Conjecture? because it was an open problem under that name 
for more than 50 years, but that terminology is no longer appropriate, since 
it is now a theorem. 


(20.2.5) Corollary (Margulis’ Theorem on Values of Quadratic Forms). Let 
Q(a) be a quadratic form inn > 3 variables, and assume Q(@ ) is: 
e not a scalar multiple of a form with integer coefficients, 


e neither positive-definite nor negative-definite, and 
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e nondegenerate. 
Then Q(Z") is dense in R. 


Proof. For simplicity, assume n = 3. Let 

e G=SL(3,R), and 

e H=SO(Q)° ={hEG| Qh) = Q(@) for all F € R?}°. 
Since Q(z) is nondegenerate, and neither positive-definite nor negative- 
definite, we have H © SO(1,2)° & PSL(2, R), so H is generated by unipotent 
elements. Furthermore, calculations in Lie theory (which we omit) show that 
the only connected subgroups of G containing H are the obvious ones: H 
and G. Therefore, Ratner’s Orbit-Closure Theorem (20.1.3) tells us that ei- 
ther: 

e HGz is closed, and Gz H is a lattice in H, or 

e the closure of HGz is all of G. 
However, if Hz = Gz 4H is a lattice in H, then the Borel Density Theorem 
(4.5.6) implies that H is defined over Q (see Exercise 5.145). Then, since 
H =80O(Q)°, a bit of algebra shows that Q(@ ) is a scalar multiple of a form 
with integer coefficients (see Exercise 4). This is a contradiction. 

Therefore, we conclude that the closure of HGz is all of G. In other 
words, 


HGz is dense in G, 
so 
HGz(1,0,0) is dense in G(1, 0,0). 
Since Gz(1,0,0) C Z°, and G(1,0,0) = R® \ {0}, this tells us that 
HZ? is dense in R?. 
Then, since Q(@ ) is continuous, we conclude that 
Q(HZ*) is dense in Q(R*). 


We also know: 
e Q(HZ?) = Q(Z°), by the definition of H, and 
e Q(R®) = R, because Q(@) is neither positive-definite nor negative- 
definite (see Exercise 5). 
Therefore Q(Z?) is dense in R. 


§20.2(iii). Products of lattices. 
(20.2.6) Corollary (see Exercise 6). [f [1 and I, are any two lattices in G, 
and G is simple, then either 


1) Ty and [2 are commensurable, so the product [112 is discrete, or 
2) Ty T2 is dense in G. 
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Exercises for §20.2. 


#1. 


#2. 


£3. 


4A, 


#69. 


#16. 


Suppose ( is a quadratic irrational. (This means that a is irrational, 
and that a is a root of a quadratic polynomial with integer coefficients. ) 
Show that 6 is badly approximable: i.e., there exists « > 0, such that 
if p/q is any rational number, then 


aS A > = 
q q 
Let Q(x1, 22) = 2? — 82x32, where 8 is any badly approximable number 


(cf. Exercise 1). Show that Q(Z?) is not dense in R. 
[Hint: There exists 6 > 0, such that |Q(p,q)| > 6 for p,q € Z~ {0}.] 


Let Q(x1,2%2) = x? —ax3, where a = 34+.2\/2. Show Q(Z?) is not dense 
in R. 
[Hint: Use previous exercises, and note that 3 + 2/2 = (1 + V2)" | 


Suppose Q(z) is a nondegenerate quadratic form in n variables. Show 
that if SO(Q)° is defined over Q, then Q(@) is a scalar multiple of a 
form with integer coefficients. 

[Hint: Up to scalar multiples, there is a unique quadratic form that is invari- 
ant under SO(Q)°, and the uniqueness implies that it is invariant under the 
Galois group Gal(C/Q).] 

Suppose Q(z) is a quadratic form in n variables that is neither positive- 
definite nor negative-definite. Show Q(R”) = R. 


[Hint: Q(AZ) = °Q(@).] 
Prove Corollary 20.2.6. 


(Hint: Let fT =T, x T2 C Gx G, and H = {(g,g) | g € G}. Show the only 
connected subgroups of G x G that contain H are the two obvious ones: H 
and Gx G. Therefore, Ratner’s Theorem implies that either HMT is a lattice 
in H, or TH is dense in G x G.] 


§20.3. Two measure-theoretic variants of the theorem 


Ratner’s Orbit-Closure Theorem (20.1.3) is purely topological, or qualitative. 
In some situations, it is important to have quantitative information. 


(20.3.1) Example. We mentioned earlier that if V is a line with irrational 
slope in R?, then the image 7(V) of V in T? is dense (see Example 20.1.1). 
For applications in analysis, it is often necessary to know more, namely, that 
m(V) is uniformly distributed in T?. Roughly speaking, this means that a 
long segment of 7(V) visits all parts of the torus equally often (see Exercise 1). 


(20.3.2) Definition. Let 


e yt be a probability measure on a topological space X, and 


c: [0,00) + X be a continuous curve in X. 
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We say that cis uniformly distributed in X (with respect to 1) if, for every 
continuous function f: X — R with compact support, we have 


lim tf sew) a= fi fa 


Ratner’s Orbit-Closure Theorem tells us that if 


e U is any one-parameter subgroup of G, and 
e x is any point in G/T, 
then the closure of the U-orbit Uz is a nice submanifold of G/T. The following 
theorem tells us that the U-orbit is uniformly distributed in this submanifold. 
(20.3.3) Ratner’s Equidistribution Theorem. Let 
e {u} be any one-parameter unipotent subgroup of G, 
e cE G/T, and 
e c(t) =u'z, fort € [0,). 
Then there is a connected, closed subgroup L of G, such that 
1) there is a (unique) L-invariant probability measure ys on La, 
2) the curve c is uniformly distributed in Lax, with respect to pu, 
3) the closure of { c(t) | t € [0, 00) } is Lx (so Lax is closed in G/T), and 
A) fiw CL. 
In the special case where V is a one-parameter unipotent subgroup of G, 


the following theorem is a consequence of the above Equidistribution Theorem 
(see Exercise 3). 


(20.3.4) Ratner’s Classification of Invariant Measures. Suppose 
e V is a subgroup of G that is generated by unipotent elements, and 
e 4 is any ergodic V-invariant probability measure on G/T. 


Then there is a closed subgroup L of G, and some x € G/T, such that pu is 
the unique L-invariant probability measure on La. 
Furthermore, L can be chosen so that: 


1) L has only finitely many connected components, 
2) L contains the identity component of V, and 
3) La is closed in G/T. 


Here is a sample consequence of the Measure-Classification Theorem: 


(20.3.5) Corollary. Suppose 
e u; is a nontrivial unipotent element of G;, for i = 1,2, 


e f: Gi/T1 > G2/T2 is a measurable map that intertwines the translation 
by uz with the translation by u2; that is, 


f(uiz) = u2 f(x), forae. x €G,/T1, 
and 
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e G, is connected and almost simple. 


Then f is a continuous function (a.e.). 


Proof. Let 
eG= Gy x Go, 
e T= Ty x To, 


e u= (ui, U2) € G, and 


e graph(f) = { (2, f(2)) | zé€G,/l,}cG/r. 
The projection from G/T to G,/T defines a natural one-to-one correspon- 
dence between graph(f) and G/T . In fact, since f intertwines u; with ue, 
it is easy to see that the projection provides an isomorphism between the 
action of u; on G;/T, and the action of u on graph(f). In particular, the 
u,-invariant probability measure 4, on GI; naturally corresponds to a u- 
invariant probability measure y on graph(f). 

The Moore Ergodicity Theorem (14.2.4) tells us that j11 is ergodic for uz, 
so pis ergodic for u. Hence, Ratner’s Measure-Classification Theorem (20.3.4) 
provides 


e a closed subgroup L of G, and 
e ceG/T, 
such that 
e y is the L-invariant measure on La, and 
e Lz is closed. 


Since the definition of y implies that i(graph( f )) = 1, and the choice of DL im- 
plies that the complement of Lx has measure 0, we may assume, by changing 
f on a set of measure 0, that 


graph(f) C La. 
Assume, for simplicity, that D is connected and G is simply connected. 


Then the natural projection from L to Gj is an isomorphism (see Exercise 4), 
so there is a (continuous) homomorphism p: G; — G2, such that 


L = graph(p). 
Assuming, for simplicity, that « = (e,e), this implies that f(g) = p(g)I for 
allg € G. So f, like p, is continuous. O 


As an example of the many important consequences of Ratner’s Measure- 
Classification Theorem (20.3.4), we point out that it implies the Equidistri- 
bution Theorem (20.3.3). The proof is not at all obvious, and we will not 
attempt to explain it here, but the following simple example illustrates the 
important precept that knowing all of the invariant measures can lead to an 
equidistribution theorem. 
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(20.3.6) Proposition. Let 
e {9°} be a one-parameter subgroup of G, 
e « be the G-invariant probability measure on G/T, 
e cE G/T, and 
e- c(t) =g'a. 
If 
e yu is the only g‘-invariant probability measure on G/T, and 
e G/T is compact, 
then the curve c is uniformly distributed in G/T, with respect to wu. 


Proof. Suppose c is not uniformly distributed. Then there is a sequence 
T, — co, and some continuous function fo € C(G/T), such that 


Tr 
fe : fo(c(t)) dt # u(fo) (20.3.7) 
0 


By passing to a subsequence, we may assume that 


a ae i 
Af) = jim 5 f f (c(t)) dt 
exists for every f € C(G/T) (see Exercise 5). Then: 


1) \ is a continuous linear functional on the space C(G/T) of continuous 
functions on G/T, so the Riesz Representation Theorem (B6.10) tells 
us that A is a measure on G/T. 

2) (1) = 1, so J is a probability measure. 

3) From the definition of A, it is not difficult to see that is g'-invariant 
(see Exercise 6). 


Since y is the only g‘-invariant probability measure, we must have A = wp. 
However, (20.3.7) says A( fo) 4 u(fo), so this is a contradiction. 


(20.3.8) Remark. Here is a rough outline of how the three theorems are proved: 
1) Measure-Classification is proved in the case where V is unipotent. 
e The general case of Measure-Classification follows from this. 
2) Equidistribution is a consequence of Measure-Classification. 


3) Equidistribution easily implies Orbit-Closure in the special case where 
V is a one-parameter unipotent subgroup. 
e The general case of Orbit-Closure can be deduced from this. 


(20.3.9) Remarks. 


1) Ratner’s Measure-Classification Theorem (20.3.4) remains valid if the 
lattice [ is replaced with any closed subgroup of G. However, the other 
two theorems do not remain valid in this generality. 
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2) Ratner’s Theorems assume that V is generated by unipotent elements, 


but N.Shah suggested that they might remain valid under the much 
weaker assumption that the Zariski closure of V is generated by unipo- 
tent elements. For the important case where the Zariski closure of V 
is semisimple (with no compact factors), this was recently proved by 
Y.Benoist and J.-F. Quint. 


Exercises for §20.3. 


#1. 


4.2. 


#3. 


4A, 


#69. 


Suppose pz is a probability measure on a compact metric space X. Show 
that a curve c: [0,co) + X is uniformly distributed with respect to wu 
if and only if, for every open subset O of X, such that (00 ) = 0, we 
have 


Jim alt € [0,7] | c(t) €O} = (0). 


[Hint: Bound the characteristic function of O above and below by continuous 
functions and apply Definition 20.3.2.] 


Show that if v is a nonzero vector of irrational slope in R?, then the 
curve c(t) = 7(tv) is uniformly distributed in T? (with respect to the 
usual Lebesgue measure on the torus). 


[Hint: Any continuous function on T? can be approximated by a trigonomet- 
tic polynomial 7 am,ne77"* 177" |] 


Show that if V = {u‘} is a one-parameter unipotent subgroup of G, 
then the conclusions of Theorem 20.3.4 follow from Theorem 20.3.3. 


[Hint: Pointwise Ergodic Theorem (see Exercise 14.3#8).] 


In the setting of the proof of Corollary 20.3.5, show that the projection 
L — G; is a (surjective) covering map. 


[Hint: Show LN ({e}M G2) is discrete, by using Fubini’s Theorem and the 
fact that graph(f) has nonzero measure.] 


Suppose 

e Tk > 00, 

e G/T is compact, and 

e c: (0,00) > G/T is a continuous curve. 
Show there is a subsequence 7}, — 00, such that 


exists for all f € C(G/T). 


[Hint: It suffices to consider a countable subset of C(G/T) that is dense in 
the topology of uniform convergence.| 
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#6. In the notation used in the proof of Proposition 20.3.6, show, for every 
f € C(G/T) and every s € R, that 


Tr 
A(f) = lim 7a f(g° c(t)) dt. 


k—-00 Ti, 


§20.4. Shearing — a key ingredient in the proof 


The known proofs of any of the three variants of Ratner’s Theorem are quite 
lengthy, so we will just illustrate one of the main ideas that is involved. To 
keep things simple, we will assume G = SL(2,R). 


(20.4.1) Notation. Throughout this section, 
e G=SL(2,R), 


i : F 
ews t is a one-parameter unipotent subgroup of G, 


t 
0]. , 
ea= fF a is a one-parameter diagonal subgroup of G, and 


0 
e X = SL(2,R)/T. 
The proofs of Ratner’s Theorems depend on an understanding of what 


happens to two nearby points of X as they are moved by the one-parameter 
subgroup u’. 


(20.4.2) Definition. If x and y are any two points of G/T, then there exists 
q € G, such that y = qu. If x is close to y (which we denote x & y), then 
q may be chosen close to the identity. Therefore, we may define a metric d 


on G/T by 
a(e,y) = min { I-10 | ae 
qt =Y 


where 


e Id is the identity matrix, and 


e ||- || is any (fixed) matrix norm on Mat2x2(R). For example, one may 


a b 
ed 
(Actually, this definition does not guarantee d(x, y) = d(y, x), so it may 
not define a metric, but let us ignore this minor issue.) 


= max{|aj, |b], |c|, |d 


Now, we consider two points x and qx, with gq = Id, and we wish to 
calculate d(u‘zx, u'gqx) (see Figure 20.4A). 


e To get from x to qx, one multiplies by q; therefore 
d(x, qx) = |\q—Id]]. 
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Figure 20.4A. The u‘-orbits of two nearby orbits. 


e To get from u’x to u’gx, one multiplies by uéqu—‘; therefore 
d(u'x,u'qr) = |ju’'qu—* — Id ||. 
(Actually, this equation only holds when the right-hand side is small 
— there are infinitely many elements g of G with gu'x = u'qx, and the 
distance is obtained by choosing the smallest one, which may not be 
u'qu* if t is large.) 


Letting 
a b 
q—Id= E i ; 
a simple matrix calculation shows that 
a ae a — bt b 
BME aN . +(a—d)t—bt? d+ ‘| pee) 


(20.4.4) Notation. For convenience, let 7, = u'x and y, = uy. 


Consider the right-hand side of Equation (20.4.3), with a, b, c, and d 
very small. Indeed, let us say they are infinitesimal (too small to see). As 
t grows, it is the quadratic term in the bottom left corner that will be the first 
matrix entry to attain macroscopic size. Comparing with the definition of u* 
(see Notation 20.4.1), we see that this is exactly the direction of the u!-orbit. 
Therefore: 


(20.4.5) Proposition (Shearing Property). The fastest relative motion be- 
tween two nearby points is parallel to the orbits of the flow. 


Yt 


x Xt 


Figure 20.4B. Shearing: If two points start out so close 
together that we cannot tell them apart, then the first dif- 
ference we see will be that one gets ahead of the other, but 
(apparently) following the same path. It is only much later 
that we will be able to detect any difference between their 
paths. 


20.4. SHEARING — A KEY INGREDIENT IN THE PROOF 419 


(20.4.6) Remarks. 

1) The only exception to Proposition 20.4.5 is that if q is in the centralizer 
Ca(u"), then u'qu~* = q for all t; in this case, the points 2; and y 
simply move along together at exactly the same speed, with no relative 
motion. 

2) In contrast to the above discussion of wu’, 

e the matrix a’ is diagonal, but 


e the largest entry in 
Qt 
tt __ a e~"b 
aqa = lm d | 


is an off-diagonal entry, 
so, under the action of the diagonal group, points move apart (at expo- 
nential speed) in a direction transverse to the orbits (see Figure 20.4C). 


Yt 


x 


Xt 


Figure 20.4C. Divergence under a diagonal subgroup: 
when two points start out so close together that we can- 
not tell them apart, the first difference we see will be in a 
direction transverse to the orbits. 


The Shearing Property (20.4.5) shows that the direction of fastest relative 
motion is along u’. However, in the proof of Ratner’s Theorems, it turns out 
that we wish to ignore motion along the orbits, and consider, instead, only 
the component of the relative motion that is transverse (or perpendicular) 
to the orbits of u’. This direction, by definition, does not belong to {u‘}. 


(20.4.7) Definition. Suppose, as before, that x and y are two points in X with 
x = y. Then, by continuity, 7; + y; for a long time. Eventually, we will be able 
to see a difference between x; and y;. The Shearing Property (20.4.5) tells us 
that, when this first happens, y; will be indistinguishable from some point on 
the orbit of x; that is, y, & x» for some t’. This will continue for another long 
time (with ¢t’ some function of t), but we can expect that y; will eventually 
diverge from the orbit of « — this is transverse divergence. (Note that 
this transverse divergence is a second-order effect; it is only apparent after 
we mod out the relative motion along the orbit.) Letting xy be the point on 
the orbit of x that is closest to yz, we write y = gx for some g € G. Then 
g — 1d represents the transverse divergence. When this transverse divergence 
first becomes macroscopic, we wish to understand which of the matrix entries 
of g — Id are macroscopic. 
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In the matrix on the right-hand side of Equation (20.4.3), we have al- 
ready observed that the largest entry is in the bottom left corner, the di- 
rection of {u’}. If we ignore that entry, then the two diagonal entries are 
the largest of the remaining entries. The diagonal corresponds to the sub- 
group {a'}. Therefore, the fastest transverse divergence is in the direction 
of {a‘}. Notice that {a‘} normalizes {u'}. 


(20.4.8) Proposition. The fastest transverse motion is along some direction 
in the normalizer of u‘. 

More precisely, if x,y € X, witha » y, andr > 0 is much smaller than 
the injectivity radius of X, then either: 


1) there exist large t,t! € R and g € Ng({u'}) such that 
u'y = gu’ x and |\g\| = d(g, {u"}) =r, 
or 


2) for allt ER, there exists t' € R, such that uty © ul x (i.e., there is no 
transverse motion, only shearing). 


To illustrate how understanding the transverse motion can be useful, let 
us prove a very special case of Ratner’s Orbit-Closure Theorem (20.1.3). 


(20.4.9) Proposition. Let C = {u'}x, for some x € X, and assume 


e C is a minimal u!-invariant closed subset of X (this means that no 
nonempty, proper, closed subset of C is {u'}-invariant), and 


e{gEG|gC=Ch= {u'}. 
Then C = {u'}x, so C is a submanifold of X. 
Proof. We wish to show C C {u‘}zx, but Exercise 1 implies that it suffices to 
prove only the weaker statement that CC Ng ({ué}) x. 

Suppose C' Z Na ({u'}) x. Then, since C' is connected, there exists y € C, 
with y © x, but y ¢ Nc({u'}) x. From Proposition 20.4.8, we see that there 
exist t,t’ € R and 

9g € Na({u'}), with g ¢ {u"}, such that u’y = gue x. 
For simplicity, let us pretend that 
u'y is equal to gu’ x, 
rather than merely being approximately equal (see Exercise 2). Then we 


have CN gC ¢ @ (because uty € C and gu’ x € gC). This contradicts 
Exercise 1. | 


Exercises for §20.4. 


#1. Under the assumptions of Proposition 20.4.9, show: 
if 9 € Na({u'}), but g ¢ {u*}, then CN gC = 90. 
(In particular, gx ¢ C.) 
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[Hint: gC is u‘-invariant (because g € Ne({u"})), so CN gC is a u’-invariant 
subset of the minimal set C.] 


#2. Complete the proof of Proposition 20.4.9 by eliminating the pretense 
that uty is equal to gu’ x. 
[Hint: The compact sets C and {g € Na({u'}) | |lgl] = d(g, {u}) =r}-C 


are disjoint, so it is impossible for a point in one set to be arbitrarily close to 
a point in the other set. 


Notes 


Theorem 20.1.3 is due to M. Ratner [13], under the assumption that V is 
either unipotent or connected. (A shorter proof can be found in [7].) This 
additional hypothesis was removed by N.Shah [15] (except for a technical 
problem involving Conclusions (2) and (3) that was resolved in [8, Cor. 3.5.4]). 

See [9] for a more thorough introduction to Ratner’s Theorems, their 
proofs, and some applications. 

See [16, Lem. 2] for the construction of orbits whose closure is not a sub- 
manifold, demonstrating the pathology in Warning 20.1.5 and Example 20.2.1. 
However, it was conjectured by G. A. Margulis [6, $1.1] that certain analogues 
of Ratner’s Theorems are valid in some situations where the subgroup V is a 
split torus of dimension > 1; see [3, §4.4c] and [8] for references on this open 
problem and its applications. 

Corollary 20.2.2 was proved by N.Shah [14]. The generalization in Re- 
mark 20.2.3(1) is due to T. Payne [10]. 

Corollary 20.2.5 is due to G. A. Margulis [4]. See [5] for a survey of its 
history and later related developments. 

Corollary 20.2.6 was discovered by N.Shah [15, Cor. 1.5]. This conse- 
quence of Ratner’s Theorem played an important role in [17]. 

Theorem 20.3.3 is due to M. Ratner [13]. 

Theorem 20.3.4 was proved by M. Ratner [12] in the case where V is either 
unipotent or connected. (See [2] for a shorter and more self-contained proof 
in the case where V = SL(2,R).) The general case is due to N. Shah [15]. 

Corollary 20.3.5 was proved by M. Ratner [11] if G1 = Ga = SL(2,R). 
The general case is due to D. Witte [19]. 

Proposition 20.3.6 is a special case of a classical result in Ergodic Theory 
that can be found in textbooks such as [18, Thm. 6.19]. 

See [1] for the work of Y.Benoist and J._F.Quint mentioned in Re- 
mark 20.3.9(2). Shah’s suggestion about Zariski closures appears in [15, end 
of §1, p. 232]. 

The discussion of shearing in Section 20.4 is excerpted from [9, §1.5], 
except that Proposition 20.4.9 is a variant of [9, Prop. 1.6.10]. 
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Appendices 


Appendix A 


Basic Facts about Semisimple Lie Groups 


§8A1. Definitions 


We are interested in groups of matrices that are (topologically) closed: 


(Al.1) Definitions. 


1) Let Mate,<¢(R) be the set of all 2 x @ matrices with real entries. This 
has a natural topology, obtained by identifying it with the Euclidean 


2 
space R° . 


2) Let SL(€,R) = {g € Mate,e(R) | detg = 1}. This is a group un- 
der matrix multiplication (see Exercise 1), and it is a closed subset of 


Mate,¢(IR) (see Exercise 2). 


3) A Lie group is any (topologically) closed subgroup of some SL(¢, R). 


Recall that an abstract group is sample if it has no nontrivial, proper, 
normal subgroups. For Lie groups, we relax this to allow normal subgroups 
that are discrete (except that the one-dimensional abelian groups R and T 


are not considered to be simple). 


(A1.2) Definition. A Lie group G is simple if it has no nontrivial, connected, 
closed, proper, normal subgroups, and G is not abelian. 


(A1.3) Example. It can be shown that G = SL/(Z, 


R) is a simple Lie group 


(when @ > 1). If is even, then {+ Id} is a subgroup of G, and it is normal, 
but, because this subgroup is not connected, it does not disqualify G from 


being simple as a Lie group. 


Recall: The Standing Assumptions (4.0.0 on page 


43) are in effect, so, as 


always, Tis a lattice in the semisimple Lie group G C SL(£, R). 


You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 
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(Al.4) Remark. Although Definition Al.2 only refers to closed normal sub- 
groups, it turns out that, except for the center, there are no normal subgroups 
at all: if G is simple, then every proper, normal subgroup of G is contained 
in the center of G. 


(A1.5) Other terminology. Some authors say that SL(n, R) is almost sim- 
ple, and reserve the term “simple” for groups that have no (closed) normal 
subgroups at all, not even finite ones. 


A Lie group is said to be semisimple if it is a direct product of simple 
groups, modulo passing to a finite-index subgroup and/or modding out a finite 


group: 
(A1.6) Definitions. 


1) G, is isogenous to G2 if there is a finite, normal subgroup JN; of a finite- 
index subgroup G‘ of G;, for 7 = 1,2, such that G/N; is isomorphic 


to G/N. 

2) G is semisimple if it is isogenous to a direct product of simple Lie 
groups. That is, G is isogenous to G; x --: x G,, where each G; is 
simple. 


(Al.7) Example. SL(2,R) x SL(3, R) is a semisimple Lie group that is not 
simple (because SL(2, R) and SL(3,R) are normal subgroups). 


(A1.8) Remark (see Exercise A4#9). If G is semisimple, then the center of G 
is finite. 


(A1.9) Assumption (cf. the Standing Assumptions (4.0.0)). Now that we 
have the definition of a semisimple group, we will henceforth assume in this 
chapter that the symbol G always denotes a semisimple Lie group with only 
finitely many connected components (but the symbol T° will never appear). 


(A1.10) Warning. A Lie group is usually defined to be any group that is 
also a smooth manifold, such that the group operations are C'° functions. 
Proposition A6.2(1) below shows that every closed subgroup of SL(@,R) is a 
Lie group in the usual sense. However, the converse is false: not every Lie 
group (in the usual sense) can be realized as a subgroup of some SL(¢, R). (In 
other words, not every Lie group is linear.) Therefore, our Definition A1.1(3) 
is more restrictive than the usual definition. (However, every connected Lie 
group is “locally isomorphic” to a linear Lie group.) 


Exercises for §A1. 


#1. Show that SL(¢,R) is a group under matrix multiplication. 


[Hint: You may assume (without proof) basic facts of linear algebra, such 
as the fact that a square matrix is invertible if and only if its determinant is 
not 0.] 


#2. 


#3. 


4A, 


45. 


#6. 
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Show that SL(@, R) is a closed subset of Mate x¢(R). 


[Hint: For a continuous function, the inverse image of a closed set is closed.] 


Recall that GL(é@,R) = {g € Matey¢(R) | detg 4 0}, and that this 
is a group under matrix multiplication. Show that it is (isomorphic 
to) a Lie group, by showing it is isomorphic to a closed subgroup of 
SL(€+ 1,R). 

Suppose G = G; x --- X G,, where each G; is simple, and N is a 
connected, closed, normal subgroup of G. Show there is a subset S of 
{1,...,r}, such that N = [],<9 Gi. 


[Hint: If the projection of N to G; is all of G;, then G; = [Gi, Gi] = [N, Gi] C 
N|] 


Show that if G is semisimple, and N is any closed, normal subgroup 
of G, then G/N is semisimple. 
[Hint: Exercise 4.] 


Suppose N is a connected, closed, normal subgroup of G. Show that 
there is a connected, closed, normal subgroup H of G, such that G is 
isogenous to N x H. 

[Hint: Exercise 4.] 


§A2. The simple Lie groups 


It is clear from Definition A1.6(2) that the study of semisimple groups requires 
a good understanding of the simple groups. Probably the most elementary 
examples of simple Lie groups are special linear groups and orthogonal groups, 
but symplectic groups and unitary groups are also fundamental. A group of 
any of these types is called “classical”? (The other simple groups are “excep- 
tional? and are less easy to construct.) 


(A2.1) Definition. G is a classical group if it is isogenous to the direct 
product of any collection of the groups constructed in Examples A2.3 and A2.4 
below. That is, each simple factor of G is either a special linear group or the 
isometry group of a bilinear, Hermitian, or skew-Hermitian form, over R, C, 


or H 


(where H is the algebra of quaternions). 


(A2.2) Notation. Let 
e g' denote the transpose of the matrix g, 


e g* denote the adjoint (that is, the conjugate-transpose) of g, 


e G° denote the identity component of the Lie group G, and 
© linn = diag(1,1,...,1,-1,-1,...,-l) € Mat (m4n)x(m+n)(R), 


where the number of 1’s is m, and the number of —1’s is n. 


(A2.3) Example. 
1) The special linear group SL(n,R) is a simple Lie group (if n > 2). 
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2) 


3) 


4) 
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Special orthogonal group. Let 
SO(m,n) = {g € SL(m+n,R) | 97 Imng =Imon}- 
This is always semisimple (if m+n > 3). It may not be connected, 
but the identity component SO(m,n)° is simple if either m + n = 3 or 
m+n> 5. (Furthermore, the index of SO(m,n)° in SO(m,n) is < 2.) 
We use SO(n) to denote SO(n,0) (or SO(0,n), which is the same 
group). 
Special unitary group. Let 
SUG A= 19 = SL, OC) Tae = ia 
Then SU(m, 7) is simple if m+n > 2. 
We use SU(n) to denote SU(n,0) (or SU(0,7)). 
Symplectic group. Let 
a 0 Tdiniscen 
Jom = @ ieee: 0 ) € GL(2m, R) 
(where Idjxm denotes the m x m identity matrix), and let 
Sp(2m, R) = { g € SL(2m, R) | g? Jom g = Jam }- 
Then Sp(2m, R) is simple if m > 1. 


(A2.4) Example. Additional simple groups can be constructed by replacing 
the field R with either the field C of complex numbers or the division ring H 
of quaternions: 


1) 


Complex and quaternionic special linear groups: SL(n,C) and 
SL(n, H) are simple Lie groups (if n > 2). 

Note: The noncommutativity of H causes some difficulty in defining 
the determinant of a quaternionic matrix. To avoid this problem, we 
define the reduced norm of a quaternionic n x n matrix g to be the 
determinant of the 2n x 2n complex matrix obtained by identifying 
H” with C?". Then, by definition, g belongs to SL(n,H) if and only 
if its reduced norm is 1. It is not difficult to see that the reduced 
norm of a quaternionic matrix is always a (nonnegative) real number 
(see Exercise 1). 


Complex and quaternionic special orthogonal groups: 
SO(n, C) = {g € SL(n,C) | g? Id g = Id} 


and 


SO(n, H) = {g € SL(n, H) | 7-(g") Id g = Id}, 
where 7, is the reversion on H defined by 
Tr(ap + ayi + aaj + a3k) = ag + aii — oj + agk. 
(Note that 7,(ab) = 7,(b)7,(a) (see Exercise 2); 7, is included in the 


definition of SO(n, H) in order to compensate for the noncommutativity 
of H (see Exercise 3).) 
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3) Complex symplectic group: Let 
Sp(2m,C) = {g ¢ SL(2m,C) | g7 Jam g = Jom }- 
4) Symplectic unitary groups: Let 
Sp(m,n) ={g €SL(m+n,H) | 9*lmng = Imn }- 


Here, as usual, g* denotes the conjugate-transpose of g; recall that the 
conjugate of a quaternion is defined by 


at+t&+ce+dk=a—bvi-—cj —dk 
(and that Ty = YZ). We use Sp(n) to denote Sp(n, 0) (or Sp(0,7n)). 


(A2.5) Other terminology. Some authors use 
e SU*(2n) to denote SL(n, H), 
e SO*(2n) to denote SO(n, H), or 
e Sp(n, R) to denote Sp(2n, R). 


(A2.6) Remark. SL(2,R) is the smallest connected, noncompact, simple Lie 
group; it is contained (up to isogeny) in any other. For example: 
1) If SL(n, R), SL(n, C), or SL(n, H) is not compact, then n > 2, so the 
group contains SL(2, R). 


2) If SO(m,n) is semisimple and not compact, then min{m,n} > 1 and 
max{m,n} > 2, so it contains SO(1, 2), which is isogenous to SL(2, R). 

3) If SU(m,n) or Sp(m,n) is not compact, then min{m,n} > 1, so the 
group contains SU(1,1), which is isogenous to SL(2, R). 

4) Sp(2m,R) and Sp(2m,C) both contain Sp(2,R), which is equal to 
SL(2, R). 

5) If SO(n, C) is semisimple and not compact, then n > 3, so the group 
contains SO(1, 2), which is isogenous to SL(2, R). 


6) If SO(n, H) is not compact, then n > 2, so it contains a subgroup 
conjugate to SU(1, 1), which is isogenous to SL(2, R). 


The classical groups are just examples, so one would expect there to be 
many other (more exotic) simple Lie groups. Amazingly, that is not the case 
— there are only finitely many others: 

(A2.7) Theorem (E. Cartan). Every simple Lie group is isogenous to either 
1) a classical group, or 


2) one of the finitely many exceptional groups. 


See Sections 18.1 and 18.3 for an indication of the proof of Theorem A2.7. 
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Exercises for §A2. 


#1. For all nonzero g € Matnxn(H), show that the reduced norm of g is a 
nonnegative real number. 


[Hint: Use row and column operations in Mat; xn(H) to reduce to the case 
where g is upper triangular. For n = 1, the reduced norm of g is g @.| 


#2. In the notation of Example A2.4, show that 7,.(ab) = 7,(b) 7,(a) for all 
a,b € H. 


[Hint: Calculate explicitly, or note that 7,(x) = 7 #j~' (and zy = yZ).] 
#3. For g,h € Matnxn(H), show that 7,((gh)?) = 7,(h7) 7-(g7). 
#4. Show that SO(n, H) is a subgroup of SL( 


n,H). 


§A3. Haar measure 


Standard texts on real analysis construct a translation-invariant measure 
on R”. this is called Lebesgue measure, but the analogue for other Lie 
groups is called “Haar measure:” 


(A3.1) Proposition (Existence and Uniqueness of Haar Measure). If H is 
any Lie group, then there is a unique (up to a scalar multiple) o-finite Borel 
measure jt on H, such that 


1) p(C) is finite, for every compact subset C of H, and 
2) u(hA) = p(A), for every Borel subset A of H, and everyhe H. 


(A3.2) Definitions. 


1) The measure yp of Proposition A3.1 is called the left Haar measure 
on H. Analogously, there is a unique right Haar measure with 
p(Ah) = (A) (see Exercise 2). 


2) H is unimodular if the left Haar measure is also a right Haar measure. 
(This means p(hA) = u(Ah) = p(A).) 


(A3.3) Remark. Haar measure is always inner regular: (A) is the supre- 
mum of the measures of the compact subsets of A. 


(A3.4) Proposition. There is a continuous homomorphism A: H > Rt, 
such that, if is any (left or right) Haar measure on H, then 


u(hAh—') = A(h) p(A), for all h € H and any Borel set A C H. 


Proof. Let yz be a left Haar measure. For each h € H, define ¢,: H — H by 
én(x) = hrh~*. Then ¢;, is an automorphism of H, so (dp),4,1 is a left Haar 
measure. By uniqueness, we conclude that there exists A(h) € R*, such that 
(in) = A(h) pw. It is easy to see that A is a continuous homomorphism. By 
using the construction of right Haar measure in Exercise 2, it is easy to verify 
that the same formula also applies to it. L] 
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(A3.5) Definition. The function A defined in Proposition A3.4 is called the 
modular function of H. 


(A3.6) Other terminology. Some authors call 1/A the modular function, 
because they use the conjugation h~!Ah, instead of hAh™!. 


(A3.7) Corollary. Let A be the modular function of H, and let A be a Borel 
subset of H. 
1) If uw is a right Haar measure on H, then u(hA) = A(h) u(A), for all 
he di. 
2) If uw is a left Haar measure on G, then (Ah) = A(h') p(A), for all 
he di. 
3) H is unimodular if and only if A(h) = 1, for allh € H. 
4) A(h) =|det(Ady h)| for allh € H (see Notation A6.17). 


(A3.8) Remark. G is unimodular, because semisimple groups have no non- 
trivial (continuous) homomorphisms to Rt (see Exercise 3). 


(A3.9) Proposition. Let : be a left Haar measure on a Lie group H. Then 
L(H) < oo if and only if H is compact. 


Proof. (<=) See Proposition A3.1(1). 

(=) Since u(H) < oo (and the measure yz is inner regular), there is a 
compact subset C' of H, such that u(C) > w(H)/2. Then, for any h € H, we 
have 

w(hC) + u(C) = w(C) + W(C) = 2n(C) > u(A), 

so hC’ cannot be disjoint from C’.. This implies that h belongs to the set 
C.-C, which is compact. Since h is an arbitrary element of H, we conclude 
that H = C-C™! is compact. 


Exercises for §A3. 


#1. Prove the existence (but not uniqueness) of Haar measure on H, with- 
out using Proposition A3.1, under the additional assumption that the 
Lie group H is a C® submanifold of SL(é, R) (cf. Proposition A6.2(1)). 


[Hint: For k = dim H, there is a differential k-form on H that is invariant 
under left translations. | 


#2. Suppose pu is a left Haar measure on H, and define fi(A) = p(A7?). 
Show j is a right Haar measure. 


#3. Assume G is connected. Show that if ¢: G — A is a continuous homo- 
morphism, and A is abelian, then ¢ is trivial. 
[Hint: The kernel of a continuous homomorphism is a closed, normal sub- 
group. | 
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§A4. G is almost Zariski closed 


(A4.1) Definitions. 


1) 


2) 


We use R[11,...,%¢,| to denote the set of real polynomials in the ? 
variables { x;,; | 1 <i,j < @}. 
For any Q € Rivi1,..., ee], and any g € Matexe(C), we use Q(g) to 
denote the value obtained by substituting the matrix entries g;,; into 
the variables x;,;. For example, if Q = %1,1%2,2 — £1,2%2,1, then Q(q) is 
the determinant of the first principal 2 x 2 minor of g. 
For any subset Q of R[a11,...,x¢,¢], let 

Var(Q) = {g € SL(Z,R) | Q(g) =0, VQ € Q}. 
This is the variety associated to Q. 
A subset H of SL(@,R) is Zariski closed if there exists a subset Q of 
R[{r11,.--,2e~], such that H = Var(Q). (In the special case where H 
is a subgroup of SL(é,R), we may also say that H is a real algebraic 
group or an algebraic group that is defined over R.) 
The Zariski closure of a subset H of SL(¢, R) is the (unique) smallest 
Zariski closed subset of SL(¢@,R) that contains H. This is sometimes 
denoted H. (It can also be denoted H, if this will not lead to confusion 
with the closure of H in the ordinary topology.) 


(A4.2) Example. 


1) 
2) 


3) 


5) 


SL(@,R) is Zariski closed. Let Q = @). 

The group of diagonal matrices in SL(@,R) is Zariski closed. Let Q = 
{xij [tA J}. 

For any A € GL(¢,R), the centralizer of A is Zariski closed. Let 


L 
— { Deere ~Ajntej)|1Sij<e 
k=1 


If we identify SL(n,C) with a subgroup of SL(2n, R), by identifying C 
with R?, then SL(n,C) is Zariski closed, because it is the centralizer 
of T;, the linear transformation in GL(2n,R) that corresponds to scalar 
multiplication by 2. 


The classical groups of Examples A2.3 and A2.4 are Zariski closed (if 
we identify C with R? and H with R* where necessary). 


(A4.3) Other terminology. 
e Other authors use GL(¢,R) in the definition of Var(Q), instead of 


SL(¢,R). Our choice leads to no loss of generality, and simplifies the 
theory slightly. (In the GL theory, one should, for technical reasons, 
stipulate that the function 1/det(g) is considered to be a polynomial. 
In our setting, det g is the constant function 1, so this is not an issue.) 
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e What we call Var(Q) is actually only the real points of the variety. 
Algebraic geometers usually consider the solutions in C, rather than R, 
but our preoccupation with real Lie groups leads to our emphasis on 
real points. 


(A4.4) Example. Let 


e 0 00 
0 et 0 0 

H=)|15 “9 1 4||teR¢ CSEGR)- 
0 0 01 


Then H is a 1-dimensional subgroup that is not Zariski closed. Its Zariski 
closure is 


Cc SL(4,R). 


teER 


a 0 00 
0 1/a 0 0} | aeR\ {0}, 
0 0 1t 

0 


The point here is that the exponential function is transcendental, not polyno- 
mial, so no polynomial can capture the relation that ties the diagonal entries 
to the off-diagonal entry in H. Therefore, as far as polynomials are concerned, 
the diagonal entries in the upper left are independent of the off-diagonal entry, 
as we see in the Zariski closure. 


(A4.5) Remark. If H is Zariski closed, then the set Q of Definition A4.1 can 
be chosen to be finite (because the ring R{r11,..., 22] is Noetherian). 


Everyone knows that a (nonzero) polynomial in one variable has only 
finitely many roots. The following important fact generalizes this observation 
to any collection of polynomials in any number of variables. 


(A4.6) Theorem. Every Zariski closed subset of SL(¢,R) has only finitely 
many connected components. 


(A4.7) Definition. A closed subgroup H of SL(¢,R) is almost Zariski closed 
if it has only finitely many components, and there is a Zariski closed sub- 
group H, of SL(@,R), such that H° = H?. In other words, in the terminology 
of Definition 4.2.1, H is conmmensurable to a Zariski closed subgroup. 


(A4.8) Examples. 


1) Let H be the group of diagonal matrices in SL(2, R). Then H is Zariski 
closed (see Example A4.2(2)), but H° is not: any polynomial that 
vanishes on the diagonal matrices with positive entries will also vanish 
on the diagonal matrices with negative entries. So H° is almost Zariski 
closed, but it is not Zariski closed. 


2) Let G = SO(1,2)°. Then G is almost Zariski closed (because SO(1, 2) 
is Zariski closed), but G is not Zariski closed (see Exercise 1). 
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These examples are typical: a connected Lie group is almost Zariski closed if 
and only if it is the identity component of a group that is Zariski closed. 


The following fact gives the Zariski closure a central role in the study of 
semisimple Lie groups. 


(A4.9) Theorem. Jf GC SL(¢,R), then G is almost Zariski closed. 


Proof. Let G be the Zariski closure of G. Then G is semisimple. (For exam- 
ple, if G is irreducible in SL(¢, C), then G is also irreducible, so Corollary A7.7 
below implies that G is semisimple. ) 

Since G has only finitely many connected components (see Assump- 
tion A1.9), we may assume, by passing to a subgroup of finite index, that 
it is connected. This implies that the normalizer Ngpic,r)(G) is Zariski closed 
(see Exercise 2). Therefore G is contained in the normalizer, which means 
that G is a normal subgroup of G 

Hence (up to isogeny), we have G=GxH , for some closed, normal 
subgroup H of re (see Exercise Al#6). So G = Ca(H)° is almost Zariski 
closed (see Example A4.2(3)). O 


(A4.10) Warning. Theorem A4.9 relies on our standing assumption that G 
is semisimple (see Example A4.4). (Actually, it suffices to know that, besides 
being connected, G is perfect; that is, G = [G, G] is equal to its commutator 
subgroup. ) 


Exercises for §A4. 


#1. Show that SO(1, 2)° is not Zariski closed. 
[Hint: We have 


rh a eee! 
s{s-i sti o]esoa2° e s>0. 
0 o' 


If a rational function f: R \ {0} — R vanishes on R*, then it also vanishes 
on R- || 

#2. Show that if H is a connected Lie subgroup of SL(¢, R), then the nor- 
malizer Nsi(e,r)(H) is Zariski closed. 


[Hint: g © N(H) if and only if ghg~' = h, where  C Mate xe(R) is the Lie 
algebra of H.| 


#3. Show that if Hf is the Zariski closure of a subgroup H of G, then gHg7! 
is the Zariski closure of gHg~', for any g € G. 


#4. Suppose G is a connected subgroup of SL(¢,R) that is almost Zariski 
closed, and that QO C Riw11,..., re¢]. 
a) Show that GM Var(Q) is a closed subset of G. 


#5. 


#6. 


HT. 


HE, 
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b) Show that if G Z Var(Q), then GNM Var(Q) does not contain any 
nonempty open subset of G. 

c) Show that if G Z Var(Q), then GN Var(Q) has measure zero, with 
respect to the Haar measure on G. 


[Hint: For (b) and (c), you may assume, without proof, that, for some d, 
there exist 


= Var(Q_1) C Var(Qo) © Var(Q1) C--- C Var(Qa) = Var(Q), 


such that GM (Var(Qzx) \ Var(Qx-1)) is a (possibly empty) k-dimensional 
C® submanifold of G, for 0 < k < d. (GN Var(Qx-1) is called the singular 
set of the variety GM Var(Q;).)| 


Show, for any subspace V of R*, that 
Stabsrcer)(V) = {g € SL(¢,R) | gV =V } 


is Zariski closed. 


A Zariski-closed subset of SL(¢, R) is irreducible if it cannot be writ- 
ten as the union of two Zariski-closed, proper subsets. Show that every 
Zariski-closed subset A of SL(¢,IR) has a unique decomposition as an 
irredundant, finite union of irreducible, Zariski-closed subsets. (By ir- 
redundant, we mean that no one of the sets is contained in the union 
of the others.) 

[Hint: The ascending chain condition on ideals of R[x1,1,..., x¢,2| implies the 
descending chain condition on Zariski-closed subsets, so A can be written as 


a finite union of irreducibles. To make the union irredundant, the irreducible 
subsets must be maximal. 


Let H be a connected subgroup of SL(@,R). Show that if H C A; U Aa, 
where A; and Ag are Zariski-closed subsets of SL(é,R), then either 
A CG Aj or H G Ao. 


[Hint: The Zariski closure H = B, U---U B; is an irredundant union of 
irreducible, Zariski-closed subsets (see Exercise 6). For h € H, we have 


H =hB, U---UhB,, so uniqueness implies that h acts as a permutation of 
{B;}. Because H is connected, conclude that H = B, is irreducible.| 


Assume G' is connected, and G C SL(@,R). Show there exist 

e a finite-dimensional real vector space V, 

e a vector v in V, and 

e a continuous homomorphism p: SL(é,R) > SL(V), 
such that G = Stabgz,¢,r)(v)°. 
[Hint: Let V, be the vector space of polynomial functions on SL(¢,R), and 
let Wy, be the subspace consisting of polynomials that vanish on G. Then 
SL(Z, R) acts on V, by translation, and W,, is G-invariant. For n sufficiently 
large, W,, contains generators of the ideal of all polynomials vanishing on G, 
so G = Stabsre,z)(Wn)°. Now let V be the exterior power A Vn, where 
d= dim W,, and let v be a nonzero vector in A“ Wy] 
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#9. Show that the center of G is finite. 


[Hint: The identity component of the Zariski closure of Z(G) is a connected, 
normal subgroup of G.| 


§A5. Three useful theorems 


§A5(i). Real Jordan decomposition. 


(A5.1) Definition. Let g © GL(n,R). We say that g is 
1) semisimple if g is diagonalizable (over C), 
2) hyperbolic if 
e g is semisimple, and 
e every eigenvalue of g is real and positive, 
3) elliptic if 
e g is semisimple, and 
e every eigenvalue of g is on the unit circle in C, 


4) unipotent (or parabolic) if 1 is the only eigenvalue of g over C. 


(A5.2) Remark. A matrix g is semisimple if and only if the minimal polyno- 
mial of g has no repeated factors. 


1) Because its eigenvalues are real, any hyperbolic g element is diagonal- 
izable over R. That is, there is some h € GL(@,R), such that h~'gh is 
a diagonal matrix. 


2) An element is elliptic if and only if it is contained in some compact 
subgroup of GL(¢,R). In particular, if g has finite order (that is, if 
g” =I1d for some n > 0), then g is elliptic. 


3) A matrix g € GL(¢,R) is unipotent if and only if the characteristic 
polynomial of g is (x — 1)‘. (That is, 1 is the only root of the charac- 
teristic polynomial, with multiplicity @.) Another way of saying this is 
that g is unipotent if and only if g —Id is nilpotent (that is, if and only 
if (g — Id)” = 0 for some n € N). 


(A5.3) Remark. Remark A2.6 implies that if G is not compact, then it con- 
tains nontrivial hyperbolic elements, nontrivial elliptic elements, and nontriv- 
ial unipotent elements. 


(A5.4) Proposition (Real Jordan Decomposition). Any element g of G can 
be written uniquely as the product g = aku of three commuting elements 
a,k,u of G, such that a is hyperbolic, k is elliptic, and u is unipotent. 
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§A5(ii). Engel’s Theorem on unipotent subgroups. 


(A5.5) Definition. A subgroup U of SL(@,R) is said to be unipotent if all 
of its elements are unipotent. 


(A5.6) Example. Let N be the group of upper-triangular matrices with 1’s 
on the diagonal; that is, 


i; 
1 * 
N= C SL(é,R). 


It is obvious that N is unipotent. 


Therefore, it is obvious that every subgroup of N is unipotent. Con- 
versely: 


(A5.7) Theorem (Engel’s Theorem). Every unipotent subgroup of SL(Z,R) 
is conjugate to a subgroup of the group N of Example A6.6. 
§A5(iii). Jacobson-Morosov Lemma. 


(A5.8) Theorem (Jacobson-Morosov Lemma). For every unipotent element u 
of G, there is a subgroup H of G isogenous to SL(2,R), such that u € H. 


Exercises for §A5. 


#1. Show that an element of SL(@,R) is unipotent if and only if it is conju- 
gate to an element of the subgroup N of Example A5.6. 


[Hint: If g is unipotent, then all of its eigenvalues are real, so it can be 
triangularized over R.| 


#2. Show that the Zariski closure of every unipotent subgroup is unipotent. 


§A6. The Lie algebra of a Lie group 


(A6.1) Definition. A map p from one Lie group to another is a homomor- 
phism if 
e it is a homomorphism of abstract groups (i.e., p(ab) = p(a) p(b)), and 
e it is continuous. 
(Hence, an isomorphism of Lie groups is a continuous isomorphism of ab- 
stract groups, whose inverse is also continuous. ) 


Although the definition only requires homomorphisms to be continuous, 
it turns out that they are always infinitely differentiable: 


(A6.2) Proposition. Suppose H; and Ho are closed subgroups of GL(é;, R), 
fort=1,2. Then 
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1) H; is aC™ submanifold of GL(¢;,R), 
2) every (continuous) homomorphism from H, to Hz is C%° and 
3) if Hy C Ho, then the coset space H2/H, is a CC® manifold. 


(A6.3) Remark. In fact, the submanifolds and homomorphisms are real ana- 
lytic, not just C°°, but we will have no need for this stronger statement. 


(A6.4) Remark. If H is any Lie group, then conjugation by any element h 
of H is an automorphism. That is, if we define a map yy: H — H by 
yn(x) = h-'axh, then yp is a continuous automorphism of H. Any such 
automorphism is said to be “tnner? The group of all inner automorphisms is 
isomorphic to H/Z(H), where Z(H) is the center of H. For some groups, there 
are many other automorphisms. For example, every inner automorphism of 
an abelian group is trivial, but the automorphism group of R” is GL(n, R), 
which is quite large. In contrast, it can be shown that the group of inner 
automorphisms of G has finite index in Aut(G) (since G is semisimple). 


(A6.5) Definitions. 
1) For A, B € Mate,.¢(R), the commutator (or Lie bracket) of A and B 
is the matrix [A, B] = AB — BA. 
2) A vector subspace § of Matey¢(R) is a Lie algebra if it is closed under 
the Lie bracket. That is, for all A,B € 6, we have [A, B] € . 


3) A map p from one Lie algebra to another is a homomorphism if 
e it is a linear transformation, and 
e it preserves brackets (that is, [o(A), e(B)] = p([A, B))). 

4) Suppose H is a closed subgroup of GL(¢,R). Then H is a C° mani- 
fold, so it has a tangent space at every point; the tangent space at the 
identity element e is called the Lie algebra of H. Note that, since 
H is contained in the vector space Mate,¢(R), its Lie algebra can be 
identified with a vector subspace of Maty,.¢(R). 


(A6.6) Notation. Lie algebras are usually denoted by lowercase German 
letters: the Lie algebras of G and H are g and h, respectively. 
(A6.7) Examples. 


1) The Lie algebra sI(¢,IR) of SL(@,R) is the set of matrices whose trace 
is 0 (see Exercise 2). 


2) The Lie algebra $o(n) of SO(n) is the set of n x n skew-symmetric 
matrices of trace 0 (see Exercise 3). 


It is an important fact that the Lie algebra of H is indeed a Lie algebra: 


(A6.8) Proposition. Jf H is a closed subgroup of SL(¢,R), then the Lie 
algebra of H is closed under the Lie bracket. 


Here is a very useful reformulation: 
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(A6.9) Corollary. Suppose H; and Hp are Lie groups, with Lie algebras hy 
and 2. If Hy is a subgroup of Hz, then hy is a Lie subalgebra of he. 


Hence, for every closed subgroup of H, there is a corresponding Lie sub- 
algebra of h. Unfortunately, the converse may not be true: although every 
Lie subalgebra corresponds to a subgroup, the subgroup might not be closed. 


(A6.10) Example. The 2-torus T? = R?/Z? can be identified with the Lie 
group SO(2) x SO(2). For any line through the origin in R?, there is a cor- 
responding 1-dimensional subgroup of T?. However, if the slope of the line is 
irrational, then the corresponding subgroup of T? is dense, not closed. 


Therefore, in order to obtain a subgroup corresponding to each Lie sub- 
algebra, we need to allow subgroups that are not closed: 


(A6.11) Definition. Suppose H; and Hz are Lie groups, and p: H; > Ho is 
a homomorphism. Then p(H1) is a Lie subgroup of Ho. 


(A6.12) Proposition. If H is a Lie group with Lie algebra §, then there is a 
one-to-one correspondence between the connected Lie subgroups of H and the 
Lie subalgebras of h. 


(A6.13) Definitions. Let H be a Lie group in SL(¢,R). 


1) If h: R > G is any (continuous) homomorphism, we call h a one- 
parameter subgroup of H, and we usually write h‘, instead of h(t). 


2) We define exp: Maty,¢(R) > GL(¢,R) by 


1 
exp X = S> Pilea os 
k=0 ~ 
This is called the exponential map. 


(A6.14) Proposition. Let h be the Lie algebra of a Lie group H C SL(é,R). 
1) For any X ©), the function x' = exp(tX) is a one-parameter subgroup 
of H. 
2) Conversely, every one-parameter subgroup of H is of this form, for 
some unique X € bh. 
Furthermore, for X € Mate.(R), we have 
XeEh = VER, exp(tx) € H. 


(A6.15) Definition. Lie groups H, and Hp are locally isomorphic if there 
is a connected Lie group H and homomorphisms p;: H + H?, for 7 = 1,2, 
such that each p; is a covering map. 


(A6.16) Proposition. Two Lie groups are locally isomorphic if and only if 
their Lie algebras are isomorphic. 
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(A6.17) Notation (adjoint representation). Suppose fj is the Lie algebra of a 
closed subgroup H of SL(¢,R). For h € H and x € § C Mate,.¢(R), we define 


(Ady h)(x) = hah" € bh. 
Then Ady: H — GL(h) is a (continuous) homomorphism. It is called the 
adjoint representation of H. 


Exercises for §A6. 


#1. Suppose p: R™ — R” is a continuous map that preserves addition. 
(That is, we have p(a+y) = p(x) + p(y).) Show (without using Propo- 
sition A6.2) that p is a linear transformation (and is therefore C'°). 
This is a very special case of Proposition A6.2. 

[Hint: By assumption, we have p(kx) = kp(x) for all k € Z, so p(tx) = tp(x) 
for all t € Q (why?). Then continuity implies this is true for all t € R.] 

#2. Verify Example A6.7(1). 

[Hint: A € sl(¢,R) iff 4 det(Id +tA)|,_, = 0, and, letting A = 1/t, we have 
det(Id +¢A) = t’ det(AI +.A) = t' (" + (trace A)\o~'+--. = 14 (trace A)t+ 
senat] 

#3. Verify Example A6.7(2). 

[Hint: A matrix A of trace 0 is in $0(n) iff “(Id+tA)* (Id +tA)|,_, ='0; 
Calculate the derivative by using the Product Rule.] 


#4. In the notation of Notation A6.17, show, for all h € H, that Ady h is 
an automorphism of the Lie algebra h. (In particular, it is an invertible 
linear transformation, so it is in GL(b). 


[Hint: The map a +> hah~' is a diffeomorphism of H that fixes e, so its 
derivative is a linear transformation of the tangent space at e.| 


§A7. How to show a group is semisimple 


A semisimple group G = G, x ---G,. will often have connected, normal sub- 
groups (such as the simple factors G;). However, these normal subgroups 
cannot be abelian (see Exercise 1). The converse is a major theorem in the 
structure theory of Lie groups: 


(A7.1) Theorem. A connected Lie group H is semisimple if and only if it 
has no nontrivial, connected, abelian, normal subgroups. 


(A7.2) Remark. A connected Lie group R is solvable if every nontrivial quo- 
tient of R has a nontrivial, connected, abelian, normal subgroup. (For exam- 
ple, abelian groups are solvable.) It can be shown that every connected Lie 
group H has a unique maximal connected, closed, solvable, normal subgroup. 
This subgroup is called the radical of H, and is denoted Rad H. Our state- 
ment of Theorem A7.1 is equivalent to the more usual statement that H is 
semisimple if and only if Rad H is trivial (see Exercise 2). 
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The following result makes it easy to see that the classical groups, such 
as SL(n, R), SO(m,n), and SU(m,n), are semisimple (except a few abelian 
groups in small dimensions). 

(A7.3) Definition. A subgroup H of GL(¢,R) (or GL(¢,C)) is irreducible 
if there are no nontrivial, proper, H-invariant subspaces of R* (or C’, respec- 
tively). 
(A7.4) Example. SL(/,R) is an irreducible subgroup of SL(¢,C) (see Exer- 
cise 3). 


(A7.5) Warning. In a different context, the adjective “irreducible” can have 
a completely different meaning when it is applied to a group. For example, 
saying that a lattice is irreducible (as in Definition 4.3.1) has nothing to do 
with Definition A7.3. 


(A7.6) Remark. If H is a subgroup of GL(¢,C) that is not irreducible (that 
is, if H is reducible), then, after a change of basis, we have 
GL(k, C) * 
Cc 2 
Ho ( 0 eee 
for some k withl1<k<n-l1. 
Similarly for GL(Z, R). 
(A7.7) Corollary. If H is a nonabelian, closed, connected, irreducible sub- 
group of SL(¢,C), then H is semisimple. 


Proof. Suppose A is a connected, abelian, normal subgroup of H. For each 
function w: A > C%, let 


Vw ={v EC’ |Vae A, a(v) = w(a)v}. 
That is, a nonzero vector v belongs to Vy if 
e v is an eigenvector for every element of A, and 


e the corresponding eigenvalue for each element of a is the number that 
is specified by the function w. 
Of course, 0 € V, for every function w; let W = {w | Vy 4 O}. (This is 
called the set of weights of A on C*.) 

Each element of a has an eigenvector (because C is algebraically closed), 
and the elements of A all commute with each other, so there is a common 
eigenvector for the elements of A. Therefore, W 4 (. From the usual argu- 
ment that the eigenspaces of any linear transformation are linearly indepen- 
dent, one can show that the subspaces {V,, | w € W } are linearly indepen- 
dent. Hence, W is finite. 

For w € W and h € H, a straightforward calculation shows that hV,, = 
Vi(w), Where (h(w))(a) = w(h~tah). That is, H permutes the subspaces 
{Vwtwew- Because H is connected and W is finite, this implies hV. = Vy 
for each w; that is, V,, is an H-invariant subspace of C’. Since H is irreducible, 
we conclude that V,, = C°. 
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Now, for any a € A, the conclusion of the preceding paragraph implies 
that a(v) = w(a) v, for all v € C*. Therefore, a is a scalar matrix. 

Since deta = 1, this scalar is an @*" root of unity. So A is a subgroup 
of the group of ¢*® roots of unity, which is finite. Since A is connected, we 
conclude that A = {e}, as desired. O 


Here is another useful characterization of semisimple groups. 


(A7.8) Corollary. Let H be a closed, connected subgroup of SL(é,C). If 
e the center Z(H) is finite, and 
e H* =H (where « denotes the “adjoint? or conjugate-transpose), 


then H is semisimple. 


Proof. Because H* = H, it is not difficult to show that H is completely 
reducible: there is a direct sum decomposition C’ = Di-1 V;, such that the 
restriction H|y, is irreducible, for each j (see Exercise 6). 

Let A be a connected, normal subgroup of H. The proof of Corollary A7.7 
(omitting the final paragraph) shows that Aly, consists of scalar multiples of 
the identity, for each 7. Hence A C Z(H). Since A is connected, but (by 
assumption) Z(#) is finite, we conclude that A is trivial. O 


(A7.9) Remark. There is a converse: if G is semisimple (and connected), then 
G is conjugate to a subgroup H, such that H* = H. However, this is more 
difficult to prove. 


Exercises for §A7. 


#1. Prove (=) of Theorem A7.1. 


#2. Show that a connected Lie group H is semisimple if and only if H has 
no nontrivial, connected, solvable, normal subgroups. 


[Hint: If R is a solvable, normal subgroup of H, then [R, R] is also normal 
in H. Repeating this eventually yields an abelian, normal subgroup.| 


#3. Show that no nontrivial, proper C-subspace of C* is invariant under 
SL(é, R). 
[Hint: Suppose v,w € R*, not both 0. If they are linearly independent, then 
there exists g € SL(@,R) with g(v + iw) = v —iw. Otherwise, there exists 
nonzero \ € C with \(v + iw) € R*] 


#4. Give an example of a nonabelian, closed, connected, irreducible sub- 
group H of SL(@,R), such that AH is not semisimple. 
[Hint: U(2) is an irreducible subgroup of SO(4).| 


#5. Suppose H C SL(¢,C). Show that H is completely reducible if and 
only if, for every H-invariant subspace W of C’, there is an H-invariant 
subspace W’ of C*, such that C’=W OW’. 
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[Hint: (>) IfW’ =Vi@---@V,, and W'’NW = {0}, but (W’@V;)NW F {0} 
for every j > s, then W'+W = CC’. (<) Let W be maximal among the 
subspaces that are direct sums of irreducibles, and let V be a minimal H- 
invariant subspace of W’. Then W @ V contradicts the maximality of W.] 


#6. Suppose H = H* C SL(é,C). 
a) Show that if W is an H-invariant subspace of C’, then the orthog- 
onal complement W~+ is also H-invariant. 
b) Show that H is completely reducible. 


Notes 


See [6] for a very brief introduction to Lie groups, compatible with Defi- 
nition Al.1(3). Similar elementary approaches are taken in the books [1] and 
[3]. 

Almost all of the material in this appendix (other than §A4) can be found 
in Helgason’s book [4]. However, we do not follow Helgason’s notation for 
some of the classical groups (see Terminology A2.5). 

Theorem A4.9 is proved in [5, Thm. 8.3.2, p. 112). 

Proposition A5.4 can be found in [4, Lem. IX.7.1, p. 430]. 

See [2, Prop. 2 in $11.2 of Chapter 8, p. 166] or [7, Thm. 3.17, p. 100] for 
a proof of the Jacobson-Morosov Lemma (A5.8). 

See [9, Thm. 2.7.5, p. 71] for a proof of Proposition A6.16. 

Remark A7.9 is due to Mostow [8]. 
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Appendix B 
Assumed Background 


Since the target audience of this book includes mathematicians from a variety 
of backgrounds (and because very different theorems sometimes have names 
that are similar, or even identical), this chapter lists (without proof or dis- 
cussion) specific notations, definitions, and theorems of graduate-level math- 
ematics that are assumed in the main text. (Undergraduate-level concepts, 
such as the definitions of groups, metric spaces, and continuous functions, are 
generally not included.) All of this material is standard, so proofs can be 
found in graduate textbooks (and on the internet). 


Bl. Groups and group actions 
8 p group 
(B1.1) Notation. Let H be a group, and let K be a subgroup. 


1) We usually use e to denote the identity element. 

2) Z(H) ={z¢€H | hz=zh for all h € H } is the center of H. 

3) Co(K) = {h © H | hk =kh for all k € K } is the centralizer of kK 

in A. 

4) No(K)={heH |hKh7!=K } is the normalizer of K in H. 
(B1.2) Definition. An action of a Lie group H on a topological space X is 
a continuous function a: H x X — X, such that 

e a(e,xz) = x for all x € X, and 
e a(g,a(h,z)) =a(gh, x) for g,hée H andre X. 


(B1.3) Definitions. Let a (discrete) group A act on a topological space M. 


1) The action is free if no nonidentity element of A has a fixed point. 


Recall: The Standing Assumptions (4.0.0 on page 43) are in effect, so, as 
always, Tis a lattice in the semisimple Lie group G C SL(@, R). 

You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 
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2) It is properly discontinuous if, for every compact subset C' of M, 
the set {A € A| CN (AC) £9} is finite. 
3) For any p € M, we define Staba(p) = {A € A | Ap = p}. This is a 
subgroup of A called the stabilizer of p in A. 
4) M is connected if it is not the union of two nonempty, disjoint, proper, 
open subsets. 


5) M is locally connected if every neighborhood of every p € M contains 
a connected neighborhood of p. 


(B1.4) Proposition. Jf A acts freely and properly discontinuously on a topo- 
logical space M, then the natural map 7: M — A\M is a covering map. 
Under the simplifying assumption that M is locally connected, this means 
that every p € A\M has a connected neighborhood U, such that the restriction 
of m to each connected component of r~1(U) is a homeomorphism onto U. 


§B2. Galois theory and field extensions 


(B2.1) Theorem (Fundamental Theorem of Algebra). The field C of complex 
numbers is algebraically closed; that is, every nonconstant polynomial f(x) € 
C[z] has a root in C. 


(B2.2) Proposition. Let F' be a subfield of C, and let a: F + C be any 
embedding. Then o extends to an automorphism @ of C. 


(B2.3) Notation. If F is a subfield of a field L, then |Z: F'| denotes dime L, 
the dimension of L as a vector space over F’. This is called the degree of L 
over F’. 


(B2.4) Proposition. [f F and L are subfields of C, such that F C L, then 
|L: F'| is equal to the number of embeddings o of L in C, such that o|p = Id. 


(B2.5) Definition. An extension L of a field F’ (of characteristic zero) is 
Galois if, for every irreducible polynomial f(x) € Fx], such that f(x) has a 
root in L, there exist aj,...,Q@n € L, such that 


f(a) = (@ — a1) ++: (@— an). 
That is, if an irreducible polynomial in F'[z] has a root in L, then all of its 
roots are in L. 


(B2.6) Definition. Let L be a Galois extension of a field F’. Then 
Gal(L/F) = {o € Aut(L) | o|r =Id}. 
This is the Galois group of L over F. 


(B2.7) Proposition. Jf L is a Galois extension of a field F of characteristic 0, 
then |Gal(L/F)| = |L: F|. 
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(B2.8) Corollary. If L is a Galois extension of a field F' of characteristic 0, 
then there is a one-to-one correspondence between 


e the subfields K of L, such that F C K, and 

e the subgroups H of Gal(L/F). 
Specifically, the subgroup of Gal(L/F) corresponding to the subfield K is 
Gal(L/K). 


§B3. Algebraic numbers and transcendental numbers 


(B3.1) Definitions. 

1) A complex number z is algebraic if there is a nonzero polynomial 
f(a) € Zia], such that f(z) = 0. 

2) A complex number is transcendental if it is not algebraic. 

3) A (nonzero) polynomial is monic if its leading coefficient is 1; that is, 
we may write f(x) = 7; _ a,x" with a, = 1. 

4) A complex number z is an algebraic integer if there is a monic poly- 
nomial f(#) € Z{x], such that f(z) = 0. 


(B3.2) Proposition. If a is an algebraic number, then there is some nonzero 
m € Z, such that ma is an algebraic integer. 


(B3.3) Proposition. The set of algebraic integers is a subring of C. 
(B3.4) Proposition. Fiz some n € N*. Let 


e w be a primitive n” root of unity, and 
e Z* be the multiplicative group of units modulo n. 
Then there is an isomorphism 
f: ZX + Gal(Qlu]/Q): k 4 fe, 
such that fx(w) =w*, for all k € ZX. 


§B4. Polynomial rings 
(B4.1) Definition. A commutative ring R is Noetherian if the following 
equivalent conditions hold: 
1) Every ideal of R is finitely generated. 
2) If I, C Ig C --- is any increasing chain of ideals of R, then there is 
some #2, Such that: J, = 14e1 St nse = 


(B4.2) Proposition (Hilbert Basis Theorem). For any field F’, the polynomial 


ring F'la1,...,%5] (in any number of variables) is Noetherian. 
(B4.3) Theorem. Let F' be a subfield of a field L. If L is finitely gen- 
erated as an F-algebra (that is, if there exist c1,...,cr € L, such that 


L=Flci,...,c,]), then L is algebraic over F. 


450 B. ASSUMED BACKGROUND 


(B4.4) Proposition (Nullstellensatz). Let 

e F be an algebraically closed field, 

e Flx,,...,x,] be a polynomial ring over F’, and 

e I be any proper ideal of F[x,...,2,]. 
Then there exist ay,...,a, € F, such that f(ai,...,a,) =0 forall f(x1,...,2,) € 
I. 


(B4.5) Corollary. If B is any finitely generated subring of C, then there is 
a nontrivial homomorphism from B to the algebraic closure Q of Q. 


(B4.6) Lemma (Eisenstein Criterion). Let f(x) € Zax]. If there is a prime 
number p, and some a € Zy \ {0}, such that 

e f(x) =ax” (mod p), where n = deg f(x), and 

e (0) £0 (mod p’), 
then f(x) is irreducible over Q. 


§B5. General topology 
(B5.1) Definitions. Let X be a topological space. 


1) A subset C of X is precompact (or relatively compact) if the closure 
of C is compact. 

2) X is locally compact if every point of X is contained in a precompact, 
open subset. 

3) X is separable if it has a countable, dense subset. 

4) If J is an index set (of any cardinality), and X; is a topological space, for 
each i € J, then the Cartesian product Xj;¢7; X; has a natural “product 
topology, in which a set is open if and only if it is a union (possibly 
infinite) of sets of the form Xj;¢;U;, where each U; is an open subset 
of X;, and we have U; = X; for all but finitely many 7. 


(B5.2) Theorem (Tychonoff’s Theorem). If X; is a compact topological space, 
for each i € I, then the Cartesian product X;c7 X; is also compact (with 
respect to the product topology). 


(B5.3) Proposition (Zorn’s Lemma). Suppose < is a binary relation on a 
set P, such that: 

e Ifa<bandb<c, thena<c. 

e Ifa<bandb<a, thena=b. 

ea<a for alla. 


e IfC CP, such that, for all ci,cg € C, either c, < cg or c2 < C1, then 
there exists b€ P, such that c <b, for allc EC. 


Then there exists a € P, such thata £ b, for allbe P. 
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§B6. Measure theory 


(B6.1) Assumption. Throughout this section, X and Y are complete, sep- 
arable metric spaces. (Recall that complete means all Cauchy sequences 
converge.) 
(B6.2) Definitions. 
1) The Borel o-algebra B(X) of X is the smallest collection of subsets 
of X that: 
e contains every open set, 
e is closed under countable unions (that is, if 41, Ao,... € B, then 
U2, Ai € B), and 
e is closed under complements (that is, if A € B, then X \ A € B). 
2) Each element of B(X) is called a Borel set. 
3) A function f: X — Y is Borel measurable if f~1(A) is a Borel set 
in X, for every Borel set Ain Y. 


4) A function w: B(X) — [0,00] is called a measure if it is countably 
additive. This means that if A,, Ao,... are pairwise disjoint, then 


lu (U 4 = > HAs) 


5) A measure yp on X is Radon if u(C’) < oo, for every compact subset C 
of X. 


6) A measure 4 on X is o-finite if X is the union of countably many 
sets of finite measure. This means X = U)~, Aj, with p(A;) < oo for 
each 2. 


(B6.3) Proposition. Jf u is a measure on X, and f is a measurable function 
on X, such that f > 0, then the integral i f du is a well-defined element of 
[0, co], such that: 


1) fy Xadu = pA) if X4 is the characteristic function of A. 
2) Sy larhi + a2 fo) du = ay ie fi dp + az ie fo du for ay,a2z € [0, 00). 


3) if {fn} is a sequence of measurable functions on X, such that we have 
O<fisfes::-, then 


[i im to an im, [fod 
(B6.4) Corollary (Fatou’s Lemma). Jf {f,}°2, is a sequence of measurable 
functions on X, with fr > 0 for all n, and pp is a measure on X, then 


i liminf f, du < lim int f Sic its 
x x 


nN—-oo nN oo 
(B6.5) Proposition. If X is locally compact and separable, then every Radon 
measure jt on X is inner regular. This means 


(EF) = sup{ u(C) | C is a compact subset of FE}, for every Borel set E. 
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(B6.6) Proposition (Lusin’s Theorem). Assume p is a Radon measure on X, 
and X is locally compact. Then, for every measurable function f: X —> R, 
and every € > 0, there is a continuous function g: X > R, such that 


w({weX | f(x) #9(e)} <e 


(B6.7) Definition. If 4: is a measure on X, and f: X — Y is measurable, 
then the push-forward of yu is the measure f,j4 on Y that is defined by 


(faus)(A) = p(f-"(A)) for ACY. 
(B6.8) Proposition (Fubini’s Theorem). Assume 


e X, and X2 are complete, separable metric spaces, and 
e pi; is ao-finite measure on X;, fori = 1,2. 
Then there is a measure v = [11 X 2 on X1 X X2, such that: 
1) v(E, x Eg) = v(E\)-v(E2) when E; is a Borel subset of X; fori = 1,2, 
and 
2) eae fdv= a fon f (x1, ©2) dz (x2) dus (x1) when the function f: X1x 
X2 — [0,00] is Borel measurable. 


(In particular, Ge f (x1, %2) du2(x2) is a measurable function of x1.) 


(B6.9) Definitions. 


1) The support of a function f: X — C is defined to be the closure of 
{ce X | f(r) #0}. 
2) C.(X) = {continuous functions f: X — C with compact support }. 
3) A: C.(X) > Cis a positive linear functional on C.(X) if: 
e it is linear (that is, A(a1 f1+a2f2) = a1A(f1)+a2A(f2) for a1, a2 € C 
and fi, fe € C(X)), and 
e it is positive (that is, if f(a) > 0 for all x, then A(f) > 0). 


(B6.10) Theorem (Riesz Representation Theorem). Assume X is locally 
compact and separable. If \ is any positive linear functional on C.(X), then 
there is a Radon measure on X, such that 


MA = f fan for all f € C(X). 


(B6.11) Definitions. Assume pz is a measure on X, and the function py: X > 
C is measurable. 


1) For 1 < p< ov, the L?-norm of ¢ is 


l= (ff iot)Panee)) 


2) An assertion P(x) is said to be true for almost all x € X (or to 
be true almost everywhere, which is usually abbreviated to a.e.), if 
u({ x | P(2) is false }) = 0. 
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3) In particular, two functions y1 and 2 are equal (a.e.) if 


u({x | p1(2) F po(z) }) = 0. 
This defines an equivalence relation on the set of (measurable) functions 
on X. 
4) The £°-norm (or essential supremum) of ¢ is 
I|¥lloo = min{ a € (—00, oo] | v(x) < a for ae. x}. 

5) LAX, ph) = {~: X > C| |lvllp < co}, for 1 < p < co. An element 
of £°(X, 4) is called an L?-function on X. Actually, two functions in 
LY(X, 1) are identified if they are equal almost everywhere, so, tech- 
nically, £?(X, 4) should be defined to be a set of equivalence classes, 
instead of a set of functions. 


(B6.12) Definition. Two measures 1, and v on X are in the same measure 
class if they have exactly the same sets of measure 0: 

u(A)=0 — > v(A)=0. 
(This defines an equivalence relation.) 
(B6.13) Theorem (Radon-Nikodym Theorem). Two o-finite measures wu 
andv on X are in the same class if and only if there 1s a measurable function 


D: X +R, such that p = Dv. That is, for every measurable subset A of X, 
we have (A) = J, Ddv. 


The function D is called the Radon-Nikodym derivative d/dv. 


§B7. Functional analysis 


(B7.1) Definitions. Let F be either R or C, and let V be a vector space 
over F. 


1) A topological vector space is a vector space V, with a topology, 
such that the operations of scalar multiplication and vector addition 
are continuous (that is, the natural maps F x V > V and V x V > V 
are continuous). 


2) A subset C of V is convex if, for all v,w € C and 0 < t < 1, we have 
tu+(1—thwec. 

3) A topological vector space V is locally convex if every neighborhood 
of 0 contains a convex neighborhood of 0. 


4) A locally convex topological vector space V is Frchet if its topology 
can be given by a metric that is complete (that is, such that every 
Cauchy sequence converges to a limit point). 

5) A norm on V is a function || ||: V — [0, 00), such that: 

(a) |ju + w| < |lv|| + ||w|| for all v,w € V, 
(b) |lav|| = Ja] |u|] for a € F and v € V, and 
(c) |u|] = 0 if and only if v =0. 
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Note that any norm || || on V provides a metric that is defined by 
d(v,w) = ||v — w||. Thus, the norm determines a topology on V. 
A Banach space is a vector space 8, together with a norm |] ||, such 


that the resulting metric is complete. (Banach spaces are Frchet.) 


An inner product on V is a function (|): V x V > F, such that 
(a) (av + bw | x) = a(v|x) + b(w|x) for a,b € F and v,w,2 € V, 
(b) (v | w) = (w|v) for v,w € V, where @ denotes the complex 
conjugate of a, and 
(c) (v|v) > 0 for all v € V, with equality iff v = 0. 
Note that if (|) is an inner product on V, then a norm on V is defined 


by the formula ||v|] = ./(v | v). 


A Hilbert space is a vector space H, together with an inner product 
(|), such that the resulting normed vector space is complete. (Hence, 
every Hilbert space is a Banach space.) 


An isomorphism between Hilbert spaces (H1, (|)1) and (He, (| )2) is 
an invertible linear transformation T': H, — H2, such that 


(Tu | Tw). = (v|w), for all v,w € Hy. 


An isomorphism from H to itself is called a unitary operator on H. 


(B7.2) Example. If yw is a measure on X, then the £?norm makes £°(X, 1) 
into a Banach space (for 1 < p < oo). Furthermore, £°(X,) is a Hilbert 
space, with the inner product 


(p |b) = [ y(e) B@) dul). 


(B7.3) Definitions. Let B be a Banach space (over F € {R, C}). 


1) 


A continuous linear functional on B is a continuous function 
A: B — F that is linear (which means A(av + bw) = aXd(v) + bA(w) 
for a,b € F and v,w € B). 


b* = {continuous linear functionals on 6 } is the dual of B. This is a 
Banach space: the norm of a linear functional is 

|All = sup{]A(v)| | v € B, lull < Tf. 
For each v € B, there is a linear function e,: B* — F, defined by 
€y(A) = A(v). The weak* topology on B* is the coarsest topology for 
which every e, is continuous. 

In other words, the basic open sets in the weak* topology are of 
the form { \ € B* | A(v) € U }, for some v € B and some open subset U 
of F. A set in 6* is open if and only if it is a union of sets that are 
finite intersections of basic open sets. 


Any continuous, linear transformation from B to itself is called a 
bounded operator on B. 
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5) The set of bounded operators on B is itself a Banach space, with the 
operator norm 


TI] = supt 7) | lel] < 1F- 


(B7.4) Proposition (Banach-Alaoglu Theorem). If B is any Banach space, 
then the closed unit ball in B* 1s compact in the weak* topology. 


(B7.5) Proposition (Hahn-Banach Theorem). Suppose 


e B is a Banach space over F, 


e W is a subspace of B (not necessarily closed), and 


e \:W > F its linear. 
Tf |XA(w)| < ||w|] for all w € W, then X extends to a linear functional A: BO 
F, such that |A(v)| < |lv|| for all u € B. 


(B7.6) Proposition (Open Mapping Theorem). Assume X and Y are Frchet 
spaces, and f: X + Y is a continuous, linear map. 


1) If f is surjective, and O is any open subset of X, then f(O) is open. 
2) If f is bijective, then the inverse f~': Y 4 X is continuous. 


(B7.7) Assumption. Hilbert spaces are always assumed to be separable. 
This has the following consequence: 


(B7.8) Proposition. There is only one infinite-dimensional Hilbert space (up 
to isomorphism). In other words, every infinite-dimensional Hilbert space is 
isomorphic to £°(R, 4), where pu is Lebesgue measure. 


(B7.9) Definitions. 


1) If Hy and Hz are Hilbert spaces, then the direct sum H, © H2 is a 
Hilbert space, under the inner product 


((p1, 92) | (di, b2)) = (er | 1) + (2 | 2). 
By induction, this determines the direct sum of any finite number of 
Hilbert spaces; see Definition 11.6.1 for the direct sum of infinitely 
many. 

2) We use “L” as an abbreviation for “is orthogonal to” Therefore, if 
y,w EH, then y | w means (vy | ) = 0. For subspaces K, K’ of H, we 
write K 1 K’ ify 1 vy’ for all p € K and y’ EK’. 

3) The orthogonal complement of a subspace K of H is 

K-={@eH | ol Kk}. 
This is a closed subspace of H. We have H =K+K+ and K 1 K+, so 
H=KeKt. 

4) The orthogonal projection onto a closed subspace K of H is the 

(unique) bounded operator P: H — K, such that 
e P(y) =¢ for all y € K, and 
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e P(w) =0 for all pe K+. 


(B7.10) Definitions. Let T: # — H be a bounded operator on a Hilbert 
space H. 


1) The adjoint of T is the bounded operator T* on H, such that 
(Te |b) = (p | T*p) for all yb € H. 

It does not always exist, but 7* is unique if it does exist. 

2) T is self-adjoint (or Hermitian) if T = T*. 

3) T is normal if TT* = T*T. 

4) T is compact if there is a nonempty, open subset O of H, such that 
T(O ) is precompact. 

(B7.11) Proposition. Let T be a bounded operator on a Hilbert space H. 

1) If T(H) is finite-dimensional, then T is compact. 

2) The set of compact operators on H. is closed (in the topology defined by 
the operator norm). 


(B7.12) Proposition (Spectral Theorem). Jf T is any bounded, normal op- 
erator on any Hilbert space H, then there exist 


e a finite measure yu on [0,1], 
e a bounded, measurable function f: [0,1] > C, and 
e an isomorphism U: H > £7((0, 1], w), 


such that U(Ty) = f U(y), for ally © H (where f U(y) denotes the pointwise 
multiplication of the functions f and U(y). 
Furthermore: 


1) T is unitary if and only if |f(x)| = 1 for a.e. x € [0,1]. 
2) T is self-adjoint if and only if f(a) € R for a.e. x € [0,1]. 


(B7.13) Definition. In the situation of Proposition B7.12, the spectral mea- 
sure of T is f,p. 


(B7.14) Corollary (Spectral Theorem for compact, self-adjoint operators). 
Let T be a bounded operator on any Hilbert space H. Then T is both self- 
adjoint and compact if and only if there exists an orthonormal basis {e,} 
of H, such that 


1) each en is an eigenvector of T, with eigenvalue Xn, 
2) An € R, and 
By lint 65 Ag =O: 


(B7.15) Proposition (Frchet-Riesz Theorem). Jf X is any continuous linear 
functional on a Hilbert space H, then there exists wy € H, such that A(p) = 
(p|w) for all p EH. 


Appendix C 
A Quick Look at S-Arithmetic Groups 


Classically, and in the main text of this book, the Lie groups under consider- 
ation were manifolds over the field R of real numbers. However, in some areas 
of modern mathematics, especially Number Theory and Geometric Group 
Theory, it is important to understand the lattices in Lie groups not only over 
the classical field R (or C), but also over “nonarchimedean” fields of p-adic 
numbers. The natural analogues of arithmetic groups in this setting are called 
“S-arithmetic groups” Roughly speaking, this generalization is obtained by 
replacing the ring Z with a slightly larger ring. 


(C0.1) Definition. For any finite set S = {pi,po2,...,Pn} of prime numbers, 
let 
Pp every prime factor 
Lg =4 = _ 
. {Zee of gisin S 


This is called the ring of S-integers. 


} = Z|1/p1,1/p2,.--,1/pn]. 


(CO.2) Example. 
1) The prototypical example of an arithmetic group is SL(@, Z). 


2) The corresponding example of an S-arithmetic group is SL(é, Zs) 
(where S is a finite set of prime numbers). 


That is, while arithmetic groups do not allow their matrix entries to have 
denominators, S-arithmetic groups allow their matrix entries to have denom- 
inators that are products of certain specified primes. 


Most of the results in this book can be generalized in a natural way to 
S-arithmetic groups. (The monographs [5] and [8] treat S-arithmetic groups 
alongside arithmetic groups throughout.) We will now give a very brief de- 
scription of these more general results. 


You can copy, modify, and distribute this work, even for commercial purposes, all without 
asking permission. http: //creativecommons.org/publicdomain/zero/1.0/ 
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(C0.3) Remark. The monograph [5] of Margulis deals with a more general 
class of S-arithmetic groups that allows R to be replaced with certain “local” 
fields of characteristic p, but we discuss only the fields of characteristic 0. 


§C1. Introduction to S-arithmetic groups 


Most of the theory in this book (and much of the importance of the theory of 
arithmetic groups) arises from the fundamental fact that G'z is a lattice in G. 
Since the ring Zs is not discrete (unless S = 0), the group Gz, is usually 
not discrete, so it is usually not a lattice in G. Instead, it is a lattice in a 
group Gg that will be defined in this section. 

The construction of R as the completion of Q can be generalized as follows: 


(C1.1) Definition (p-adic numbers). Let p be a prime number. 


1) If x is any nonzero rational number, then there is a unique integer 
U = v,(x), such that we may write 
— mv a 
t=pPp b? 
where a and 0 are relatively prime to p. (We let v,(0) = oo.) Then 
Up(x) is called the p-adic valuation of x. 


2) Let 
dag) =—p ee, 


It is easy to verify that d, is a metric on Q. It is called the p-adic 
metric. 


3) Let Q, be the completion of Q with respect to this metric. (That is, Q, 
is the set of equivalence classes of convergent Cauchy sequences.) This 
is a field that naturally contains Q. It is called the field of p-adic 
numbers. 


4) If G is an algebraic group over Q, we can define the group G(Q,) of 
Q,-points of G. 


(C1.2) Notation. To discuss real numbers and p-adic numbers uniformly, it 
is helpful to let Q, = R. 


The construction of arithmetic subgroups by restriction of scalars (see Sec- 
tion 5.5) is based on the fact that the ring O of integers in a number field F’ 
embeds as a cocompact, discrete subring in @,,< s,, fv. Using this fact, it 
was shown that G(Q) is a lattice in Xyes,, G(Fi). 

Similarly, to obtain a lattice in a p-adic group G(Z,), or, more generally, 
in a product X yesu{co} Fu of p-adic groups and real groups, we note that 


Zs embeds as a cocompact, discrete subring in Se Qp- 
pESU{oo} 
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Using this fact, it can be shown that 


G(Os) isa latticeinGs= x  G(Q,). 
pESU{oco} 


We call G(Og) an S-arithmetic subgroup. 


(C1.3) Example. Let G be the special linear group SL. 
1) Letting S = Q, we have Zg = Z and Gg = SL(n,R). So SL(n, Z) is an 
S-arithmetic lattice in SL(n,R). This is a special case of the fact that 
every arithmetic lattice is an S-arithmetic lattice (with S = 0). 


2) Letting S = {p}, where p is a prime, we see that SL(n, Z[1/p]) is a 
lattice in SL(n, R) x SL(n, Q,). 
3) More generally, letting S = {pi,p2,...,pr}, where p1,...,p, are primes, 
we see that SL(n, Zs) is a lattice in 
SL(n, R) x SL(n, Qp,) x SL(n, Qp,) x +--+ x SL(n, Q,,). 
This is an elaboration of our previous comment that SL(@, Zs) is the proto- 
typical example of an S-arithmetic group (see Example C0.2). 


(C1.4) Remark [2, Chap. 7]. In the study of arithmetic subgroups of a Lie 
group G, the symmetric space G/K is a very important tool. In the theory 
of S-arithmetic subgroups of Gg, this role is taken over by a space called the 
Bruhat-Tits building of Gg. It is a Cartesian product 


Xs =(G/K)x X Xp, 
pes 


where X, is a contractible simplicial complex on which G(Q,,) acts properly 
(but not transitively). 


Optional: Readers familiar with the basic facts of Algebraic Number 
Theory will realize that the above discussion has the following natural gener- 
alization: 

(C1.5) Definition ([5, p. 61], [8, p. 267]). Let 

e O be the ring of integers of an algebraic number field F, 

e S be a finite set of finite places of F’, and 

e G be a semisimple algebraic group over F’, and 

e Gg = Xvesus.. G(Fr). 

Then G(Oz) is an S-arithmetic subgroup of Gs. 


(C1.6) Remark. More generally, much as in Definition 5.1.19, if 
e I” is an S-arithmetic subgroup of G4, and 
e y: Go > Gz is a surjective, continuous homomorphism, with compact 
kernel, 
then any subgroup of Gg that is commensurable to y(I’) may be called an 
S-arithmetic subgroup of Gg. 
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(C1.7) Theorem [8, Thm. 5.7, p. 268]. Every S-arithmetic subgroup of Gs 
is a lattice in Gg. 


§C2. List of results on S-arithmetic groups 


(C2.1) Warning. The Standing Assumptions (4.0.0) do not apply in this 
appendix, because G is not assumed to be a real Lie group. 


Instead: 


(C2.2) Assumption. Throughout the remainder of this appendix: 
e G is a semisimple algebraic group over Q, 
e S is a finite set of prime numbers, and 
e Tis an S-arithmetic lattice in Gg = X pesu{oo} G(Qp). 


To avoid trivialities, we assume Gg is not compact, so I is infinite. 
(C2.3) Definition. As a substitute for real rank in this setting, let 


S-rankG = S- rankg, G. 
pESU{co} 


(C2.4) Remark. All of these results generalize to the setting of Definition C1.5, 
but we restrict our discussion to Q for simplicity. 


The following theorems on S-arithmetic groups are all stated without 
proof, but each result is provided with a reference for further reading. The 
reader should be aware that these references are almost always secondary 
sources, not the original appearance of the result in the literature. 


Results related to Chapter 4 (Basic Properties of Lattices). 

(4.4.35) [\Gs is compact if and only if the identity element e is not an 
accumulation point of Ts [9, Thm. 1.12, p. 22]. 

(4.4.45) If [ has a nontrivial, unipotent element, then I\Gg is not compact. 
In fact, Godement’s Criterion (5.3.1) tells us that the converse is also 
true. 

(4.5.15) The Borel Density Theorem holds, for any continuous homomor- 
phism p: Gs — GL(V), where V is a vector space over R, C, or any 
p-adic field Q, [5, Thm. IT.2.5 (and Lem. II.2.3), p. 84]. 

(4.7.105) I is finitely presented [8, Thm. 5.11, p. 272]. 

(4.8.25) (Selberg Lemma) I has a torsion-free subgroup of finite index |9, 
Thm. 6.11, p. 93]. 


(4.9.25) (Tits Alternative) [T has a nonabelian free subgroup [5, App. B, 
pp. 351-353]. 
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Results related to Chapter 5 (What is an Arithmetic Group?). 


(5.2.15) (Margulis Arithmeticity Theorem) If S-rankG > 2, then every ir- 
reducible lattice in Gg is S-arithmetic [5, Thm. IX.1.10, p. 298, and 
Rem. (vi) on p. 290]. (Note that our definition of irreducibility is stronger 
than the one used in [5].) 


(5.3.15) (Godement Criterion) [\Ggs is compact if and only if [ has no non- 
trivial unipotent elements [8, Thm. 5.7(2), p. 268}. 


Remark. Exercise 4.4410 (easily) implies: 


1) {13, Thm. 1] If v is any nonarchimedean place of F’, then every lattice 
in G(F,,) is cocompact. 


2) If G(S..) is compact, then every lattice in G is cocompact. 


Warning. We know that if O is the ring of integers of F’, then G(O) embeds 
as an arithmetic lattice in Xyes,, G(F,), but that restriction of scalars allows 
us to realize this same lattice as the Z-points of an algebraic group defined 
over Q (cf. Proposition 5.5.8). This means that all arithmetic groups can 
be found by using only algebraic groups that are defined over Q, not other 
number fields. It is important to realize that the same cannot be said for 
S-arithmetic groups: most extensions of Q provide many S-arithmetic groups 
that cannot be obtained from Q. 

For example, suppose p is a prime in Z, but p factors in the integers O 
of an extension field, and a is a prime factor of p in O. Then the subgroup 
SL(2, O[1/p]) can be obtained by restriction of scalars, but SL(2, O[1/a}) is 
an {a}-arithmetic subgroup that cannot be obtained by this method. 


A result related to Chapter 12 (Amenable Groups). 


(12.4.55) For v € S, if G(F,) is not compact, then G(F),) is not amenable 
[12, Rem. 8.7.11, p. 260]. 


Results related to Chapter 13 (Kazhdan’s Property (T)). 


(13.2.45) If rankp,G > 2, for every simple factor G of G(F,,), and every 
v € S, then Gg has Kazhdan’s property [5, Cor. III.5.4, p. 130). 


(13.4.1) If Gg has Kazhdan’s property, then I also has Kazhdan’s property 
[5, Thm. IIL.2.12, p. 117]. 


(13.4.35) IfT has Kazhdan’s property, then I'/|I, I] is finite [5, Thm. H1.2.5, 
p. 115]. 
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A result related to Chapter 16 (Margulis Superrigidity Theo- 
rem). 
Assumption. Assume 

e S-rankG > 2, 

e I is irreducible, and 


e w is a place of some algebraic number field F’. 


(16.1.65) (Margulis Superrigidity Theorem [5, Prop. VII.5.3, p. 225) If 
e G’ is a Zariski-connected, noncompact, simple algebraic group 
over F’,, with trivial center, and 
e »: T > G’(F’,) is a homomorphism, such that y(L) is: 
o Zariski dense in G’, and 
o not contained in any compact subgroup of G’(F7/,), 
then y extends to a continuous homomorphism ¢: Gs > G’(FY,). 
Furthermore, there is some v € S, such that F), is isomorphic to a 
subfield of a finite extension of FY. 


Warning. Exercise 16.441 does not extend to the setting of S-arithmetic 
groups: for example, the lattice SL(n, Z) is not cocompact, but the image of 
the natural inclusion SL(n, Z) — SL(n, Q,) is precompact. 


Results related to Chapter 17 (Normal Subgroups of LI). 


(17.1.1s5) (Margulis Normal Subgroups Theorem [5, Thm. VIII.2.6, p. 265]) 
Assume 


e S-rank G > 2, 

e IT is irreducible, and 

e N isa normal subgroup of I. 
Then either N is finite, or ['/N is finite. 


(17.2.15) If S-rank G = 1, then T has (many) normal subgroups N, such that 
neither N nor ['/N is finite [4, Cor. 7.6]. 


Results related to Chapter 18 (Arithmetic Subgroups of Classi- 
cal Groups). 

(18.1.15) Let Q, be the algebraic closure of Q,. Then all but finitely many 
of the simple Lie groups over Q» are isogenous to either SL(n, Q»); 
SO(n, Q,), or Sp(2n,Q,), for some n [3, Thm. 11.4, pp. 57-58, and 
Thm. 18.4, p. 101]. 

(18.1.75), (18.5.39) Every Q-form or Q,-form of SL(n,Q,), SO(n, Q,), or 
Sp(n, Q,) is of classical type, except for some “triality” forms of SO(8, Q,) 
(cf. Remark 18.5.10) [8, §2.3]. 
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(18.5.105) Unlike R, the field Q, has extensions of degree 3, so some Q,,-forms 
of SO(8, Q,) are triality groups, even though there are no such R-forms 
of SO(8, C). 

(18.7.15) Gg has a cocompact, S-arithmetic lattice [1]. 

(18.7.45) If G is isotypic, then Gg has a cocompact, irreducible lattice that 
is S-arithmetic [1]. 

(18.7.55) If Gg has an irreducible, S-arithmetic lattice, then G is isotypic. 


A result related to Chapter 19 (Construction of a Coarse Fun- 
damental Domain). If F is any coarse fundamental domain for G(Z) in 
G(R), then there is a compact subset C of X pes G(Q,), such that F x C is 
a coarse fundamental domain for G(Zg) in Gg [8, Prop. 5.11, p. 267]. 

This implies that every S-arithmetic subgroup of Gg is a lattice [8, 
Thm. 5.7, p. 268], but the short proof outlined in Section 7.4 does not seem 
to generalize to this setting. 


Results related to Chapter 20 (Ratner’s Theorems on Unipotent 
Flows). Ratner’s three main theorems (20.1.3, 20.3.3, and 20.3.4) have all 
been generalized to the S-arithmetic setting by Ratner [10, 11] and Margulis- 
Tomanov [6, 7] (independently). 
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Cu(K) = centralizer of K in H, page 447 

Nu(K) = normalizer of K in H, page 447 

Stab,(p) = stabilizer of p, page 448 

|L: F| = degree of field extension, page 448 

Gal(L/F') = Galois group, page 448 

B(X) = {Borel subsets of X}, page 451 

f.(4 = push-forward of measure jz, page 452 

C.(X) = {continuous functions with compact support}, page 452 
LX, ) = {L"functions on X}, page 453 

B =a Banach space, page 454 

H =a Hilbert space, page 454 

B* = dual of B, page 454 

||Z'|| = operator norm, page 455 

1 = “is orthogonal to”, page 455 

K+ = orthogonal complement of the subspace K, page 455 


Appendix C. A Quick Look at S-Arithmetic Groups 

S =a finite set of prime numbers, page 457 

Tig Z(1/p1, ik eee L/D, | is the ring of S-integers, page 457 
Q» = field of p-adic numbers, page 458 

Qo = R, page 458 

Gg = X pesu{oco} G(Qp), page 459 

S-rank G = Dd pesuites} rankg, G, page 460 


a-T-menable, see Haagerup property 


a.e., 448 
action of a group, 443 
adjoint 
of a linear transformation, 452 
representation, 438 
affine isometry, 268 
algebraic 
group 
over Q, 84 
over R, 430 
real, 430 
integer, 445 
number, 445 
number field, 98 
almost 
all, 448 
everywhere, 448 
almost-invariant vector, 259 
amenable group, 229, 230, 289 
anti-involution, 145 
standard, 146 
arithmetic 
hyperbolic manifold, 130 
subgroup, 16, 83, 88 
asymptotic cone, 24 


badly approximable, 407 
Banach 

A-module, 253 

space, 450 
bilinear form, 112 
Borel 

function, 447 

set, 447 

o-algebra, 447 

subgroup, 187 
Borel-Serre compactification, 66 
bounded operator, 450 
Bruhat-Tits building, 455 


Cartan 
decomposition, 217 
involution, 395 

Cayley 
graph, 203 
plane, 183 

center 
of a group, 443 
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of mass, 240 
centralizer, 443 
character (of an abelian group), 221 
circumcenter, 272 
classical 

group, 425 

type, 373 
coarse 

geodesic, 205 

triangle, 207 
coarsely geodesic, 205 
coboundary, 366 
cocompact subgroup, 29, 46 
cocycle, 366 

cohomologous, 366 
cogrowth of a discrete group, 252 
cohomology (1st), 366 
commensurable, 48, 431 

abstractly, 50 
commensurator, 49, 91 
commutator, 436 
compact 

convex H-space, 229 

linear operator, 452 
compactly generated, 67, 261 
complete metric space, 447, 449 
completely reducible, 440 
completion of F’, 101 
complexification, 51, 91, 106, 187, 360 
component, transverse, 415 
conditional measure, 291 
conformal, 308 
conjugate (of a quaternion), 116, 427 
conjugation on quaternion algebra, 146 
connected, 444 

locally, 65 
contracting homeomorphism, 73 
conull set, 284 
converge uniformly on compact sets, 11 
convergence in measure, 356 
convex set, 449 
countably additive, 447 
covering map, 444 
crossed homomorphism, 366 
cyclic, for a projection-valued measure, 

224 


defined 
over Q, 84, 96, 199 
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over R, 85 
degree 
of a field extension, 444 
of a division algebra, 144 
6-thin, 207 
dimension of a representation, 211 
direct integral, 226 
direct sum 
of Hilbert spaces, 225, 451 
of representations, 212 
discriminant (of quadratic form), 125 
divergent torus orbit, 392 
division algebra, 116, 143, 148 
central, 143 
cyclic, 145 
dual of a Banach space, 450 


elliptic element of G, 434 
energy, 340 
equivalent embeddings in C, 100 
equivariant, 218 
ergodic, 285 
component, 291 
essential supremum, 449 
essentially, 284 
H-invariant, 284 
A-equivariant, 248 
exceptional Lie group, 360, 425 
expander graphs, 262 
exponential map, 437 
extreme point, 293 


finite volume (homogeneous space), 45 
finitely 
generated, 62 
presented, 62 
fixed point, 4 
flag in R°, 335 
flat 
in a symmetric space, 21 
manifold, 11 
vector bundle, 321 
Fglner 
sequence, 238 
set, 289 
Fourier Analysis, 223 
fractal, 403, 404 
free action, 443 
Frchet space, 449 
full 
density, 297 
subgroup of a Z-lattice, 170 


INDEX 


fundamental 
domain, 44 
coarse, 63, 67, 387 
strict, 44 
set, 63 


Galois 

conjugate of G, 101 

extension, 444 

group, 444 

Theory, 366 
general position, 335 
generic element, 184 
geodesic symmetry, 6 
geodesically complete, 8 
Geometric Group Theory, 203 
geometric property, 204 
Geometric superrigidity, 341 
Gram matrix, 111 


Haagerup property, 244, 262, 278 
Haar measure, 428 
right, 428 
harmonic 
function, 251 
measure, 336 
Hermitian 
matrix, 146 
operator, 452 
Hilbert space, 289, 450 
homogeneous space, 4 
homomorphism 
of Lie algebras, 436 
of Lie groups, 435 
hybrid (of hyperbolic manifolds), 128 
hyperbolic 
element of G, 434 
Gromov, 207, 349 
group, relatively, 349, 356 
manifold, 126 
with totally geodesic boundary, 132 
plane, octonionic, 183 
space 
complex, 183 
real, 183 
surface, 311 


identity component, 4 
infinitesimal, 414 
injectivity radius, 51 
inner 
automorphism, 436 
product, 450 


interbreeding hyperbolic manifolds, 128 
intertwining operator, 212 
invariant 
measure, 284 
set, 284 
subspace, 212 
invariant function, 78 
involution, 4 
involutive isometry, 4 
irreducible 
lattice, 50, 106 
linear group, 439 
locally symmetric space, 17, 51 
representation, 213 
symmetric space, 10 
Zariski-closed subset, 433 
isogenous, 49, 424 
over Q, 199 
isogeny, 49 
isolated fixed point, 4 
isomorphic unitary representations, 212 
isomorphism 
of Hilbert spaces, 450 
of Lie groups, 435 
isotropic vector, 137, 178 
isotypic semisimple Lie group, 106 


K-finite vector, 216 
Kazhdan 

constant, 262 

group, 260 

property (T'), 66, 260 

relative, 263 

kernel of positive type, 275 
Killing form, 113 


largest 
entry, 415 
term, 414 
lattice 
in Vg, see Z-lattice 
subgroup, 15, 23, 45, 46 
least upper bound, 294 
Lebesgue 
measure, 428 
number, 54 
Lie 
algebra, 436 
bracket, 436 
group, 423 
linear, 424 
subgroup, 437 
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linear functional 
positive, 448 
continuous, 450 
locally 
compact, 446 
connected, 444 
convex, 449 
isomorphic, 49, 437 
symmetric, 13, 14, 66 
LP- 
function, 449 
norm, 448 
°-°-norm, 449 


matrix coefficient, 216 
mean, 235 
bi-invariant, 236 
measure, 447 
Borel, 283 
class, 284, 449 
finite, 284 
probability, 234 
Radon, 447 
minimal 
closed, invariant set, 337, 416 
surface, 348 
mixing, 296 
of order r, 297 
strongly, 296 
weak or weakly, 295 
modular function, 48, 429 
moduli space, 313 


nearby points, 413-414 
negative-definite, 406 
negatively curved group, 207 
net subgroup, 70 
nilpotent group, 188 
Noetherian ring, 85, 445 
noncompact type, 14 
nondegenerate 

bilinear form, 112 

quadratic form, 112 
norm, 449 

matrix, 413 

of an algebraic number, 98 
normal operator, 452 
normalizer, 415, 443 
null set, 284 


octonions, 183 
one-parameter subgroup, 437 
diagonal, 413 
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unipotent, 412 Q- 
operator norm, 451 form, 96 
order (in an algebraic number field), 98, rank, 192, 193 
144 simple group, 95 
orthogonal split, 191 
complement, 451 subgroup, 84 
group, special, 426 quadratic form, 112 
projection, 451 degenerate, 406 
quasi- 
p-adic conformal, 309 
field, 454 invariant measure, 284 
metric, 454 isometric, 204 
valuation, 454 isometry, 204 
parabolic quasisplit Q-form of G, 138 
element of G, 434 quaternion algebra, 116 
Q-subgroup, minimal, 197 
subgroup, 187 R- 
minimal, 184, 250 form, 360 
place rank, 178 
archimedean, 101 radical 
complex, 100 of a bilinear form, 275 
finite, 101 of a Lie group, 247, 438 
infinite, 101 Radon-Nikodym derivative, 449 
nonarchimedean, 101 rank 
of F’, 100 of an abelian group, 97 
p-adic, 101 over a field F’, 193 
real, 100 Q, see Q-rank 
Polish topological space, 290 R- or real, 178 
polynomial real root, 180 
monic, 445 reduced norm, 116, 426 
on a vector space, 96 reducible 
with rational coefficients, 199 lattice, 50 
Pontryagin dual, 221 linear group, 439 
Ponzi scheme on A, 254 Reduction Theory, 152 
positive reductive, 181 
definite, 10, 252, 274, 406 regular 
semi-definite, 274 measure, inner, 428, 447 
type, 273, 274 representation, 212 
conditionally, 274 relative motion, 414 
Weyl chamber, see Weyl chamber, fastest, 415 
positive relatively compact, 446 
precompact, 446 representation 
primitive vector, 164 almost, 253 
principal congruence subgroup, 68, 200 induced, 218 
product near, 253 
action, 295 regular, see regular representation 
topology, 446 trivial, 212 
proper metric space, 205 uniformly bounded, 253 
properly discontinuous, 444 unitarizable, 253 
property (F'H), 270 residually finite, 72 
proximal, 336 Restriction of Scalars, 71, 323 


push-forward of measure pu, 448 reversion anti-involution, 122, 146, 426 


Riemannian manifold, 3 


S- 

arithmetic, 455 

integers, 453 
Sanov subgroup, 77 
second countable, 234 
second-order effect, 415 
self-adjoint operator, 452 
semisimple 

element of G, 434 

Lie group, 424 
separable topological space, 446 
shearing property, 414 

in SL(2,R), 415 
Siegel set, 66, 310, 388-390 

generalized, 390 

normal, 395 

open, 391 
o-finite measure, 283, 447 
simple 

absolutely, 103 

abstract group, 423 

algebra, 148 

almost, 424 

Lie group, 423 

module, 144 

ring, 144 
simply connected, algebraically, 318 
singular set, 433 
skew field, 143 
solvable Lie group, 438 
special linear group, 425, 426 
spectral measure, 452 
stabilizer, 444 

essential, 286 
star (of finitely many rays), 24 
star-shaped neighborhood, 6 
stationary, 336 
subexponential growth, 245 
subrepresentation, 212, 266 
support, 448 
symmetric 

difference, 291 

generating set, 203 

measure, 251 

neighborhood, 6 

space, 5, 7 
symplectic group, 426 

complex, 427 

unitary, 427 
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tangent cone at infinity, see asymptotic 


cone 
thin, 6-, see 6-thin 
Tits building, 25, 394 
topological vector space, 449 
torsion free, 68 
torus, 177 

Q-split, 191 

R-split, 177 
totally 

geodesic hypersurface, 129 

isotropic, 178, 184 

real number field, 112 
transcendental number, 445 
transverse divergence, 415 
tree, 273 
triality, 359, 375, 459 
two-point homogeneous, 22 
type Fi, 66 


uniform subgroup, 46 
uniformly distributed, 288, 408 
unimodular group, 428 
unipotent 

element, 52, 434 

elementary matrix, 332 

radical, 198 

subgroup, 434 
unirational, 89, 109 
unitary 

dual, 213 

group, special, 426 

operator, 450 

representation, 211 


variety, 430 


weak* topology, 450 
weakly contained, 266 
weights of a representation, 439 
Weyl chamber, positive, 198, 388 
word 

length, 203 

metric, 203 


Z-lattice, 96 
unimodular, 164 
Zariski 
closed, 430 
closure, 430 


List of Named Theorems 


Albert-Brauer-Hasse-Noether, 148 

arithmetic subgroups are lattices, 16, 
86, 399 

Arzel-Ascoli, 11 


Banach-Alaoglu, 451 
Banach-Tarski Paradox, 246, 255 
Borel Density, 57 


Choquet’s, 293 

Commensurability Criterion for 
Arithmeticity, 91, 325 

Commensurator Superrigidity, 325 

Congruence Subgroup Property, 70, 
341, 344 


Decay of matrix coefficients, 214 


Eisenstein Criterion, 446 
Engel’s, 435 
Ergodic 
decomposition, 290 
Maximal, 289 
Mean, 289 
Pointwise, 288 
Existence of Haar Measure, 428 


Frchet-Riesz, 452 

Fubini’s, 15, 448 

Fundamental Theorem of Algebra, 444 
Furstenberg’s Lemma, 249 


Godement Compactness Criterion, 92 


Hahn-Banach, 451 
Higman-Neumann-Neumann, 349 
Hilbert Basis, 445 


Implicit Function, 311 
Iwasawa decomposition, 186 
of SL(n, R), 152 


Jacobson-Morosov, 52, 86, 435 
Jordan Decomposition, real, 434 


Kakutani-Markov Fixed Point, 231 
Krein-Milman, 293 

Kronecker’s, 328 
Kronecker-Weber, 141 


Langlands decomposition, 185, 197 
Lebesgue Differentiation, 351 
Lubotzky-Mozes-Raghunathan, 205 
Lusin’s, 80, 448 
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Mahler Compactness Criterion, 53 
Margulis 

Arithmeticity, 29, 66, 90, 323 

Normal Subgroups, 343 

Superrigidity, 307 

p-adic, 323, 325 

Values of Quadratic Forms, 406 
Martingale Convergence, 337 
Mautner phenomenon, 216 
mean proximality, 337 
Meyer’s, 121, 148, 378, 385 
Moore Ergodicity, 214, 286, 410 
Mostow Rigidity, 17, 29, 321 
weak version, 305 
Mostow Rigidity Theorem, 321 
Mostow-Prasad Rigidity, 315 
multiple recurrence, 300 


Nullstellensatz, 71, 446 


Open Mapping, 451 
Oppenheim Conjecture, 406 


Peter-Weyl, 219 
Ping-Pong Lemma, 74 
Poincaré Recurrence, 59 


Radon-Nikodym, 449 
Ratner’s 
Classification of Invariant Measures, 
409 
Equidistribution, 409 
Orbit-Closure, 402 
Reduction Theory for Arithmetic 
Groups, 390 
Riesz Representation, 448 


Schur’s Lemma, 147, 213 
Selberg’s Lemma, 68 
Skolem-Noether, 385 
Spectral, 223, 452 
Szemeredi’s, 300 


Tits Alternative, 73 
Tychonoff’s, 446 


Urysohn’s Metrization, 234 


von Neumann Selection, 292 
von Neumann’s Conjecture, 245 


Wedderburn’s, 144 


Zorn’s Lemma, 446 


